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ISOLATED DENTRITIC CELL MEMBRANE PROTEIN GENES^ 



This filing claims priority from Provisional U.S. Patent 
5 Application USSN 60/053,080, filed July 9, 1997, which is incprporated 
hereiri by reference. 

FIELD OF THE INVENTION 
The present invention contemplates compositions related to 
10 genes found in lymphocytes, e.g., cells which function in the^ immune 
system. These genes are useful markers, and may function in 
controlling development, differentiation, and/or physiology of the 
mammalian immune system. In particular, the application provides 
nucleic acids, proteins, antibodies, and methods of usiiig them. 

15 

BACKGROUND OF THE INVENTION 
The circulating component of the mammalian circulatory system 
comprises various cell types, including red and white blood cells of the 
erjrthroid and myeloid cell lineages. See, e.g., Rapaport (1987) 
20 Introduction to Hematology (2d ed.) Lippincott, Philadelphia, PA; Jandl 
(1987) Blood: Textbook of Hematolog y. Little, Brown and Co., Boston, 
MA.; and Paul (ed.) (1993) Fundamental Ipimu^ology (3d ed.) Raven 
Press, N Y. 

Dendritic cells (DC) are antigen-processing or presenting cells, and 
25 are found in all tissues of the body. See Steinman (1991) Annual 

Review of Immunolog y 9:271-296; and Banchereau and Schmitt (eds. 
1994) Dendritic Cells in Fundamental and Clinical Immunology Plenum 
Press, NY. These DC can be classified into various categories, including: 
interstitial dendritic cells of the heart, kidney, gut, and lung; Langerhans 
30 cells in the skin and mucous membranes; interdigitating dendritic cells 
in the thymic medula and secondary lymphoid tissue; and blood and 
lymph dendritic cells. Although dendritic cells in each of these 
compartments are CD45+ leukocytes that apparently arise from bone 
marrow, they may exhibit differences that relate to maturation state and 
35 microenvironment. 

These dendritic cells efficiently process and present antigens to, 
e.g., T cells. They stimulate responses from naive and memory T cells 
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Hie primaiy and secondary B^cell fbUicles contain follicular 
dendritic cells that trap and retain intact antigen ais immune complexes for 
long periods of time. These dendritic cells present native antigen to B cells 
and are likely to be involved in the tiie affinity maturation of. antibodies/ 
5 the geheratipn of immune memory, and the maintenance of humoral 
immune responses. 

Monocytes are phagocytic cells that belong to the mpnonudear 
phagocyte system and reside in the circulation. See Roitt (ed) Encyclopedia: 
of Immunolopv Academic Press, San Diego. These cells originate in the 

10 bone marrow and remain only a short time in the marrow compartment 
once they differentiate. They tihen enter the circulation and can remain 
tiiere for a relatiyely long period of time, e.g., a few days. The nionocytes 
can enter the tissues and body cavities by the process designated 
diapedesis, where they differentiate into macrophages and possibly into 

15 dendritic cells. In an inflammatory response, the number of monocytes in 
the circulation may double, and many of the increased number of 
monocytes diapedese to the site of inflammation. 

Antigen presentation refers to the cellular events in which a 
proteinaceous antigen is taken up, processed by antigen presenting cells 

20 (APC), and then recognized to initiate an immune response. The most 
active antigen presenting cells have been characterized as the 
macrophages, which are direct developmental products from monocytes; 
dendritic cells; and certain B cells. 

Macrophages are found in most tissues and are highly active in 

25 internalization of a wide variety of protein antigens and microorganisms. 
They have a highly developed endocytic activity, and secrete many 
products important in the iiutiation of an immune response. For this 
reason, it is believed that many genes expressed by monocytes or induced 
by monocyte activation are likely to be important in antigen uptake, 

30 processing, presentation, or regulation of the resulting immune response. 

However, dendritic cells and monocytes are poorly characterized, , 
both in terms of proteins they express, and many of their functions and 
mechanisms of action, including their activated states. In particular, the 
processes and mechanisms related to the initiation of an immune response, 

35 including antigen pocessing and presentation, remain unclear. The 

absence of knowledge about tiie structural, biological, and physiological 
properties of these cells Umits their understanding. Thus, medical 
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conditions where regtitation^ deveippment^ or physiology of antigen 
presenting cells is unusual remain unmanagea^^^ 

SUMMARY OF THE INVENTiON 
5 The present inventioin is based, in part, upon the .discovery of 

various mammalian Dendritic CeU Membrane Protein (DCMP) genes, 
exemplified by the specific DCMPl and DCMP2 embodiments. 
Distribution data indicates a broader cellular distribution/ and structural 
data suggests some function. The DCMPl exhibits similarity to a class of 

10 lectins arid asialoglycoprotein receptors. The DCMP2 embodiments 
described exhibit significant sequence similarity to a macrophage cell 
asialoglycdprpteih receptor. The invention embraces agonists and 
antagonists of the gene products, e.g., mutations (muteins) of the natural 
sequences, fusion proteins, cheniical mimetics, antibodies, and other 

15 structural or functional analogs. It is also directed to isolated genes 

encoding proteins of the invention. Various uses of tiiese different protein 
or nucleic acid composition are also provided. 

In particular embodiments, the invention provides a binding 
compound comprising an antibody binding site which specifically binds to 

20 a DCMPl protein; or a polypeptide selected from: Gly Val Ser Glu Leu Gin 
Glu His Thr Thr Gin Lys Ala His Leu Gly His Cys Pro His Cys Pro Ser Val 
Cys Val Pro (residues 118-144 of SEQ ID NO: 4); Gin Val Ala Thr Leu Asn 
Asn Asn Ala Ser Thr Glu Gly Thr Cys Cys (residues 166-181 of SEQ ID NO: 
4); or Trp Lys Pro Gly Gin Pro Asp Asn Trp Gin Gly His Gly Leu Gly 

2 5 (residues 263-277 of SEQ ID NO: 4). In preferred embodiments, in tiie 

binding compound, the antibody binding site is: specifically 
immunoreactive with a protein of SEQ ID NO: 2 or 8; specifically 
immunoreacfave with a protein of residues 118 to 144 of SEQ ID NO: 4; 
raised against a purified or recombinantiy produced human DCMPl 
30 protein; raised against a purified or recombinantiy produced human 

protein comprising sequence of residues 118 to 144 of SEQ ID NO: 4; in a 
monoclonal antibody. Fab, or F(ab)2; or the binding compoimd is: 
detectably labeled; sterile; or in a buffered composition. 

The invention embraces methods using those binding compounds, 

3 5 comprising contacting the binding compound with a biological sample 

comprising an antigen to form a binding compound:antigen complex. In 
certain embodiments, the biological sample is human, and the binding 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02562 PeTAJS98/13436 



compottiid is an antibody. The invention also proVideis a[ det^ctipn kit 
comprising such binding compound and: instmctional materisd for i^^^ 
of such binding compound for the detection; or a compartment providing 
segregation of the binding compound. 
5 The invention also provides a substantially pure or isolated 

polypeptide, which specifically binds to such binding compounds. In 
various embodiments^ the polypeptide: comprises at least a fragment of at 
least 14 amino acid residues from a primate DCMPl protein; comprises at 
least 14 amino acids of residues 118 to 144 of SEQ ID NO: 4; is a soluble 

10 polypeptide; is detectably labeled; is in a sterile composition; is in a 

buffered composition; binds to an sialic acid residue; is recombinantly 
produced, or has a naturally occurring polypeptide sequence* 

Nucleic acid embodiments are provided, including a nucleic acid 
encoding a polypeptide above> when purified. Often, the nucleic acid: 

15 comprises at least 30 nucleotides of the coding portion of SEQ ID NO: 1 or 
7; comprises at least 30 nucleotides from nucleotides 608-688 of SEQ ID 
NO: 3; or comprises at least 30 nucleotides from nucleotides 752-799 of SEQ 
ID NO: 3, or it may comprise an insert which selectively hybridizes to a 
nucleic acid encoding a polypeptide defined above. The invention also 

20 provides a cell transfected with such a nucleic acid, e.g., which consists of 
the protein encoding portions of SEQ ID NO: 1, 7, or the appropriate 
portions of SEQ ID NO: 3. 

The invention provides methods using at least one strand of those 
nucleic acids to form a duplex nucleic acid, comprising a step of contacting 

25 such strand to a sample to a complementary strand capable of specifically 
hybridizing. In preferred embodiments, the method allows detection of 
the duplex; or allows histological localization of the duplex. 

Alternatively, the invention provides methods of using a described 
binding composition, comprising a step of contacting the binding 

30 composition with a sample to form a binding composition:antigen 

complex. In preferred embodiments, the sample is a biological sample, 
including a body fluid; the antigen is on a cell; or the antigen is further 
purified. 

The invention further embraces methods using those polypeptides, 
35 comprising contacting the polypeptide with a sample to form a binding 
composition:polypeptide complex. In preferred embodiments, the 
polypeptide is further purified. 
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Another method provided is to modulating dendritic cell 
physiology or function comprising a step of contacting the cell with: a 
binding composition as decribed; a DCMPl protein as described; or a 
polypeptide as decribed. The function may also restdt in initiation or 
5 progression of an immune response. T5T)icaily/the contacting is in 

combination with an antigen, including a cell surfece, MHC Glass I, or 
MHC Class n antigen. 

DETAILED DESCRIPTION 

10 

All references cited herein are incorporated herein by reference to 
the same extent as if each individual publication or patent application was 
specifically and individually indicated to he incorporated by reference in 
its entirety for all purposes. 

15 

L General 

The present invention provides DNA sequences encoding 
mammalian proteins expressed on dendritic cells (DQ. For a review of 
dendritic cells, see Steinman (1991) Annual Review of Immunology 9:271- 

20 296; and Banchereau and Schmitt (eds. 1994) Dendritic Cells in 

Fundamental and Clinical Immunology Plenum Press, NY. These proteins 
are designated dendritic cell proteins because they are found on these cells 
and appear to exhibit some specificity in their expression. 

Specific primate, e.g., hiunan, embodiments of these proteins are 

25 provided below. Rodent, e.g., mouse, coimterparts also exist The 

descriptions below are directed, for exemplary purposes, to the human DC 
genes, but are hkewise applicable to structually, e.g., sequence, related 
embodiments from other sources or mammalian species, including 
polymorphic or individual variants. These will include, e.g., proteins 

30 which exhibit a relatively few changes in sequence, e.g,, less than about 

5%, and number, e.g., less than 20 residue substitutions, typically less than 
15, preferably less than 10, and more preferably less than 5 substitutions, 
including 4, 3, 2, or 1. These will also include versions which are truncated 
from full length, as described, and fusion proteins containing substantial 

3 5 segments of these sequences. 

n. Definitions 
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The term "binding composition" refers to molecules that bind with 
specificity to a fliese DC proteins, e.g., in an antibody-antigen interaction. 
Other compounds, e.g., proteins, can also specifically associate with the 
respective protein. Typically, the specific association will be in a natural 
physiologically relevant protein-protein interaction, either covalent or noil- 
covalent; and may include members of a multiprotein complex, including 
carrier compounds or dimerization partners. The molecule may be a 
polymer, or chemical reagent A functional analog may be a protein with 
structural modifications, or may be a wholly unrelated molecule, e.g., 
which has a molecxilar shape which interacts with the appropriate 
interacting determinants. The variants may serve as agonists or 
antagonists of the protein, see, e.g., Goodman, et al. (eds.) (1990) Goodman 
& Gilman^ s: The Pharmacological Bases of Therapeutics (8th ed.) 
Pergamon Press, Tanytown, N.Y. 

The term "binding agentDC protein complex", as used herein, refers 
to a complex of a binding agent and DC protein. Specific binding of ttie 
binding agent means that the binding agent has a specific binding site that 
recognizes a site on the respective DC protein. For example, antibodies 
raised to the DC protein and recognizing an epitope on the DC protein are 
capable of forming an antibodyiDC protein complex by specific binding. 
Typically, the formation of a binding agentDC protein complex allows the 
measurement of that DC protein in a mixture of other proteins and 
biologies. The term "antibodyiDC protein complex" refers to a binding 
agentDC protein complex in which the binding agent is an antibody. The 
antibody may be monoclonal, polyclonal or even an antigen binding 
fragment of an antibody, e.g., including Fv, Fab, or Fab2 fragments. 

"Homologous" nucleic acid sequences, when compared, exhibit 
sigTuficant similarity. The standards for homology in nucleic acids are 
either measures for homology generally used in the art by sequence 
comparison and/or phylogenetic relationship, or based upon hybridization 
conditions. Both algorithms for sequence comparison and hybridization 
conditions are described in greater detail below. 

An "isolated" nucleic acid is a nucleic acid, e.g., an RNA, DNA, or a 
mixed polymer, which is substantially separated from other components 
which naturally accompany it, e.g., proteins and flanking genomic 
sequences from the originating species. The term embraces a nucleic acid 
sequence which has been removed from its naturally occurring 
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environment and includes reepmbina or cloned DNA isolates and 
chemically synthesized ariailbgs or analogs biologically synthesized by 
heterologous systems. A substantially pure molecule includes isolated 
forms of the molecule. An isolated nucleic add will generally be a 
homogeneous composition of molecules, but will, in some embodiments; 
contain minor heterogeneity. This heterogeneity is typically found at the 
polymer ends or portions npt critical to a desired biological function or 
activity. 

As used herein, the term "DCMPl protein" shall encompass, when 
used in a protein context, a protein having amino acid sequences as shown 
in SEQ ID NO: 2 or 8, or a significant fragment of such a protein. It refers 
to a polypeptide which interacts with the respective DCMPl protein 
specific binding components, These binding components, e.g., antibodies, 
typically bind to the DCMPl protein witii high affinity, e.g., at least about 
100 nM, usually better than about 30 nM, preferably better than about 10 
nM, and more preferably at better than about 3 nM. 

The term "DCMP2 forms" refers to the sequences provided in SEQ 
ID NO: 4 and 10. The nucleotide and corresponding amino acid squence of 
primate, e.g., human, protein related to lectin/ asialoglycoprotein family 
members, designated DCMP2, isolated from a dendritic cell library are 
provided in SEQ ID NO: 3 and 4. The long form is as shown, while the 
short form lacks the sequence corresponding to residues 118-144. The short 
form may also differ at nucleotide 1064. This is related to a monocyte form 
of an ASGPR, differing by an insertion between residues 173 and 174, and 
at residue 270, see Table 1, and insert of sequence encoding GEE between 
nucleotides 775-776. Another variant form is described in SEQ ID NO; 9 
and 10. 

The term "polypeptide" or "protein" as used herein includes a 
significant fragment or segment of said protein, and encompasses a stretch 
of amino acid residues of at least about 8 amino acids, generally at least 10 
amino acids, more generally at least 12 amino acids, often at least 14 amino 
acids, more often at least 16 amino acids, typically at least 18 amino acids, 
more typically at least 20 amino acids, usually at least 22 amino acids, more 
usually at least 24 amino acids, preferably at least 26 amino acids,more 
preferably at least 28 amino acids, and, in particularly preferred 
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embodiments, at least about 30 of more amino acids, e.gr, 35, 40/ 45> 50, 60, 
7d>etc. 

Prefeired embodiments . exhibit a plurality: of distinct; e.g., 
nonoverlapping, segments of the specified lengtii. Typically, the plurality 
5 will be at least two, more usually at least three, and preferably 5, 7, or even 
m^ore. While the length minima are provided, longer lengths, of various 
sizes, may be appropriate, e.g., one of length 7, and two of length 12. 
Segments may refer to either peptides or oligonucleotides. 

A "recombinant" nucleic acid is typically defined by its structure. It 
iO can be a nucleic acid made by generating a sequence comprising fusion of 
two ^agments which are not naturally contiguous to each ottier, but is 
meant to exclude products of nature, e.g., naturally occurring mutant 
forms. 

Certain forms are defined by a method of production. In reference 

15 to such, e.g., a product made by a process, the process is use of 

recombinant nucleic acid techniques, e.g., involving htmian intervention in 
the nucleotide sequence, typically selection or production. 

Thus, the invention encompasses, for example, nucleic acids 
comprising sequence derived using a synthetic oligonucleotide process, 

20 and products made by transforming cells with a non-naturally occurring 
vector which encodes these proteins. Such is often done to replace a codon 
with a redundant codon encoding the same or a conservative amino acid, 
while typically introducing or removing a sequence recognition site, e.g., 
for a restriction enzyme. Alternatively, it is performed to join together 

25 nucleic acid segments of desired functions to generate a single genetic 
entity comprising a desired combination of functions not found in the 
commonly available natural forms. Restriction en2yme recognition sites 
are often the target of such artificial manipulations, but other site specific 
targets, e.g., promoters, DNA replication sites, regulation sequences, 

30 control sequences, or other useful features, e.g., primer segments, may be 
incorporated by design. A similar concept is intended for a recombinant, 
e.g., fusion, polypeptide. Specifically included are synthetic nucleic acids 
which, by genetic code redundancy, encode polypeptides similar to 
fragments of these antigens, and fusions of sequences from various 

3 5 different species variants. 

"Solubility" is reflected by sedimentation measured in Svedberg 
units, which are a measxu^ of the sedimentation velocity of a molecule 
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iiilldCT conditions. The detenn^ of the sedimentation 

velocity was classically perfonned in an ancdyticil uItFacentrifiage> but is 
typicailly now perform^ m a steindard ultraceiitnfuge. See> Freifelder 
(1982]i Physical Biochemistry r2d ed ) W h: Freeman & Go.> San Frandsco,, 
5 CA; and Gantoi; and Schimmel (19801 Biophysical Chemistry parts lr3, 
W.H. Freieman & Co., San Frahcfecd> CA. As a crude defermmatipn, a 
sample containing a putatively soluble polypeptide is spun in a standard 
full sized ultracentrifuge at about 50K rpm for about 10 minutes, and 
soluble molecules will remain in the supernatant. A soluble particle or 

10 polypeptide will typically be less than about 303, more typically less than 
about 15S, usually less than about lOS^ more usually less than about 6S, 
and/in partictdar embodiments, preferably less than about 4S, and more 
preferably less thai! about 3S, Solubility of a polypeptide or fragment 
depends upon the enyironment and the polypeptide. Many parameters 

15 affect polypeptide sdlubilify, includihg temperature, electrolyiB 

environment, size and molecular characteristics of the pol3^eptide, and 
nature of the solvent Typically, the temperature at which the polypeptide 
is used ranges from about 4_ C to about 65_ C Usually the temperature at 
use is greater than about 18_ C and more usually greater than about 22_ C 

'20 For diagnostic purposes, the temperature will usually be about room 
temperature or warmer, but less than the denaturation temperature of 
components in the assay. For therapeutic purposes, the temperature will 
usually be body temperature, typically about 37_ C for humans, though 
under certain situations the temperature may be raised or lowered in situ 

25 or in vitro. 

The size and structure of the polypeptide should generally be in a 
substantially stable physiologically active state, and usually not in a 
denatured state. The polypeptide may be associated with other 
polypeptides in a quaternary structure, e.g., to confer solubility, or 

30 associated with lipids or detergents in a manner which approximates 
natural lipid bilayer interactions. 

The solvent will usually be a biologically compatible buffer, of a 
type used for preservation of biological activities, and will usually 
approximate a physiological solvent Usually the solvent will have a 

35 neutral pH, typically between about 5 and 10, and preferably about 7.5. 
On some occasions, a detergent wOl be added, typically a mild non- 
denaturing one, e.g., e.g., CHS (cholesteryl hemisuccinate) or CHAPS (3- 
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([3K:hplamid6pr6pyl]dimethyi-ammoni^ sulfonate)^ or in a low • 

enough detergent ^concentration as to avoid significant disruption of 
structural or physiological properties of the protein. 

"Substantially pure" typically means that the protein is isolated from 
5 other contaminating prdteins, nucleic acids, or other biolbgicals derived 
from the original source organism. Purity/or "isolation", may be assayed 
by standard methods, typically by weight, and will ordinarily be at least 
about 50% pure, more ordinarily at least about 60% pure, generally at least 
about 70% pure, more generally at least about 80% pure, often at least 
10 about 85% pure, more often at least about 90% pure, preferably at least 
about 95% pure, more preferably at least about 98% pure, and in most 
preferred embodiments, at least 99% pure. Carriers or excipierits will often 
be added, or the formulation may be sterile or comprise buffer 
components. 

15 "Substantial similarity" in the nucleic acid sequence comparison 

context means either that the segments, or ttieir complementary strands, 
when compared, are identical when optimally aligned, with appropriate 
nucleotide insertions or deletions, in at least about 50% of the nucleotides, 
generally at least 56%, more generally at least 59%, ordinarily at least 62%, 

20 more ordinarily at least 65%, often at least 68%, more often at least 71 %, 
typically at least 74%, more typically at least 77%, usually at least 80%, 
more usually at least about 85%, preferably at least about 90%, more 
preferably at least about 95 to 98% or more, and in particular 
embodiments, as high at about 99% or more of the nucleotides. 

2 5 Alternatively, substantial similarity exists when the segments will 

hybridize under selective hybridization conditions, to a strand, or its 
complement, typically using a sequence derived from SEQ ID NO: 1 or 7, 
or appropriate parts of 3 and 9. Typically, selective hybridization will 
occur when there is at least about 55% similarity over a stretch of at least 

3 0 about 30 nucleotides, preferably at least about 65% over a stretch of at least 

about 25 nucleotides, more preferably at least about 75%, and most 
preferably at least about 90% over about 20 nucleotides. See, Kanehisa 
(1984) Nuc. Acids Res. 12:203-213. The length of similarity comparison, as 
described, may be over longer stretches, and in certain embodiments will 
35 be over a stretch of at least about 17 nucleotides, usually at least about 20 
nucleotides, more usually at least about 24 nucleotides, typically at least 
about 28 nucleotides, more typically at least about 40 nucleotides. 
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preferably at least about 50' nucleotides^ and more preferably at least about • 
75 to 100 or more nucleoiddies. The measttres of comparison for the DCMPl 
do not reflect on those comparison measiures for the i>C!MP2 eml50diments. 
"Stringent Conditions' in refiemhg to homology or substantial 
5 similarity in the hybridization context^ will be stringent combined 

conditions of salt temperature, organic solvents, and other parameters, 
typically those controlled in hybridization reactions. The combination of 
parameters is more important than the measure of any single parameter. 
See, e.g., Wetmur and Davidson (1968) LMoLBioL 31:349-370. A nucleic 

10 acid probe which binds to a target nucleic acid under stringent conditions 
is specific for said target nucleic acid.: Such a probe is typically more than 
11 nucleotides in length, and is sufficiently identical or complementary to a 
target nucleic acid over the region specified by the sequence of the probe to 
bind the target uindef stringent hybridization conditions. 

15 Counterpart DCMP proteins ^om other mammalian species can be 

cloned and isolated by cross-species hybridization of closely related 
species. See, e.g., below. Similarity may be relatively low between 
distantly related species, and thus hybridization of relatively closely 
related species is advisable. Alternatively, preparation of an antibody 

20 preparation which exhibits less species specificity may be useful in 
expression cloning approaches. 

The phrase "specifically binds to an antibody" or "specifically 
immunoreactive with", when referring to a protein or peptide, refers to a 
binding reaction which is determinative of the presence of the protein in 

25 the presence of a heterogeneous population of proteins and other biological 
components. Thus, under designated immimoassay conditions, the 
specified antibodies bind to a particular protein and do not significantly 
bind other proteins present in the sample. Specific binding to an antibody 
xmder such conditions may require an antibody that is selected for its 

30 specificity for a particular protein. For example, antibodies raised to the 
human DCMPl protein immunogen with the amino acid sequence 
depicted in SEQ ID NO: 2 or 8 can be selected to obtain antibodies 
specifically immunoreactive with that DCMP protein and not with other 
proteins. These antibodies recognize proteins highly similar to the 

35 homologous huiman DCMPl protein. 

HL Nucleic Acids 
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These DCMP genes are selectively exjpfessed on dendritic cells. The 
preferred embodiinent5> as disclosed^ will be iiseM in stan 
to isolate genes from other species^ e.g., warm blooded animals> such as 
birds and mammals. Cross hybridization will allow isolation of related 
5 proteins from individuals, strains, or species. A number of different 

approaches are available successfully to isolate a suitable nucleic acid clone 
based upon the information provided herein. Southern blot hybridization 
studies should identify homologous genes in other species under 
appropriate hybridization conditions. 

10 Purified protein or defined peptides are useful for generating 

antibodies by standard methods, as described below. Synthetic peptides or 
purified protein can be presented to an immune system tp generate 
polyclonal and mionoclonal antibodies. See, e.g;, Coligan (1991) Current 
Protocols i n Immunologv Wiley/Greene, NY; and Harlow andLane (1989) 

15 Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY, which 
are incorporated herein by reference. Alternatively, a DCMP antigen 
binding composition can be useful as a specific binding reagent, and 
advantage can be taken of its specificity of binding, for, e.g., purification of 
a DCMP protein. 

20 The specific binding composition can be used for screening an 

expression library made from a cell line which expresses the respective 
DCMP protein. Many methods for screening are available, e.g., standard 
staining of surface expressed ligand, or by panning. Screening of 
intracellular expression can also be performed by various staining or 

25 immunofluorescence procedures. The binding compositions could be 
used to affinity purify or sort out cells expressing the antigen. 

Table 1: Alignment of primate, e.g., human, lectin/ASGPR family 
members. ASGPRhl and ASGPRh2 are hepatic aeialoglycoprotein receptors 
30 (see SEQ ID NO: 5 and S) ; ASGPRra is a macrophage derived ASGPR ; DCMP2 
has short, long, and a variant forms, SBQ ID NO: 4 and 10; DCMPl is 
presented in SEQ ID NO: 2 and 8) . 

ASGPRhl MTTKE . , YQDLQHLDNEESDHHQLRKGPPPPQPLLQRLCSGP RLLLLSLG 

3 5 ASGPRh2 MAKD . . FQDIQQLSSEENDHP . FHQGPPPAQPLAQRLCSMV CFSLLALS 

ASGPRm MtRT . . YENFQYLENKVKVQG . FKNGPLPLQSLLQRLRSGP CHLLLSLG 

DCMP2 s MTRT . . YENFQlfLENKVKVQG . FKNGPLPLQSLLQRLRSGP CHLLLSLG 

DCMP2 1 MTRT . . YENFQYLENKVKVQG . FKNGPLPLQSLLQRLRSGP CHLLLSLG 

DCMP2 V OTRT . . YENFQYLENKVKVQG . FKNGPLPLQS 

4 0 DCMPl MTSEITYAEVR FKNEFKSSGINTASSAASKERTAPHKSNTGFPKLLCASLLIFF 

feature **** +++++ 

ASGPRhl LSLLLLWVCVIGS . QNSQIflEELRGLRETFSNFTASTEAQVKGLSTQGGNVGRKMKSLESQLE . 
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AS6PIUi2 
ASGPRm 
DCMP2S 
DCMP2i 
DCMP2V 
DCMPl 

feature 



FmiJLjiVWCVTGS . QSAQLQAEUISIJCBAFSNFSSSTLTEVQAISTHGGS^ . 
LGIJaLLVIIGWG , FQNSKFQraLVTLRTD^ 

liGLLLLVIICWG . FQNSKFQRDLVTLRTDFSNFTSm'VMIQi^TSQGSSI^ 
U3LLLLVIICWG . FQNSKFQM3LVTLRTDFSNFTSNTW^IQALTS^^^ 
. . LLLLVIICVVG . FQNSKFQHDLVTlJlTDFSNFTSimrftEIQALTSQGSSLEETIASI^^ 
LLLAISFFIAFVIFFQKyS . Q . . U:£KKrr . KELVHTn.E . . . . Cn/IOCNMPVBEtAWS .V 



++++++++ 



10 



15 



^^D^^ * DLSEDHSSIJiLHVKQFVSDLRSLSCOMAALQGN^^ 

ASGPRh2 .KQQQ, , DLKADHDAIJiFHLKHFPVDIJiFVACQIffiLL^ 

ASGPRin FKQERQA VHSEMLLRVQQLVQDLKKLTCQVATLNNNGE 

DCMP2S FKQERQA. VHSEMLLRVQQLVQDLKKLTCQVATLNNN . 

DCMP21 PKQERQAGVSELQEHIT^KAHIXSHCPHCPSVCVPVHSEMI^VQQLVQDIJ^ 

Daff>2 V FKQERQA VHSEMLLRVQQLVQDIJ^ 

DCMPl 

feature r • • • • • • • . ....... 



20 



25 



ASGPRhl ER TCCPVNWVEHERSCYWFSRSGKAWADADNYCRIiBDAHLVVVTS WE 

ASGPRh2 QR TGCPVNWVEHQGSCYWFSHSGKAWAEAEKYCQIJSNAHLVVINSWEEQKFIVQHl^ 

ASGPRin KASTEGTCCPVNWVEHQDSCYWFSHSGMSWAEAEKyCCLKNAH 

DCMP2S .ASTEGTCCPVim^HQDSCTWFSHSGMSWAEABKyCQWCN^ 

DCMP21 .ASTEGTCCPVNWVEHQDSCYWFSHSGMSWAEAEKYCQLKNAHLVVINS^^^ 

DCMP2V EyVSTEGTCCPVNWVEHQDSCYWFSHSGMSWAEAEKYCQLKNAHL 

CXTPKNWKSFSSNCYFISTESASWQDSBmOUlMEAHLLVIOTQEEQDFIFQ^^ 



DCMPl 
feature 



30 



35 



40 



45 



50 



ASGPRhl 
ASGPRh2 
ASGPRm 
DCMP2B 
DCMP21 
DCMP2V 
DCMPl 
feature 

ASGPRhl 
ASGPRh2 
ASGPRm 
DCMP26 
DCMP21 
DCMP2V 
DCMPl 
feature 



W.MGLHDQNGP. . WKWVDGTDYETGFKNWRPEQPDDWYGHGLGGGEDCA . .HFTDDGR. . ,WNDD 
W.IGIiTDSDGS. .WKWVDGTDYRHNYKNWAVTQPDNWHGHELGGSEDCV. .EVQPDGR. . , WNDD 
W.MGLSDPEGA. . WKWVDGTDYATGFQNWKPGQPDDWQGHGLGGGEDCA. .HFHPDGR. , .WNDD 
W.MGLSDPEGA. . WKWVDGTDYATGFQNWKPGQPDNWQGHGLGGGEDCA . .HFHPDGR. . .WNDD 
W.MGLSDPEGA. . WKWVDGTDYATGFQNWKPGQPDDWQGHGLGGGEDCA. .HFHPDGR. . .WNDD 
W.mLSDPEGA. .WKWVDGTDYATGFQNWKPGQPDDWQGHGLGGGEDCA. .HFHPDGR. . .WNDD 

YFVGLSDPEGQRHWQWVDQTPYNESSTFWHPREPSD PNERCWLNFRKSPKRWGWNDV 

XXX 

VCQRPYRWVCETELDKASQEPPLOj 
FCLQVYRWVCEKRRNATGE. . .VA 
VCQRPYHWVCEAGLGQTSQESH 
VCQRPYHWVCEAGLGQTSQESH 
VCQRPYHWVCEAGLGQTSQESH 
VCQRP YHWVCEAGLGQTSQE SH 
NCLGPQRS VCEMMKIH L 



features: internalization domain (an extended domain EITYAEV is 

seen in the NK receptor NKA) ; +++ transmembrane domain; ... C-type 
lectin domain; XXX sugar specificity domain. The DCMPl receptor is 
closest in homology to the macrophage lectin in the lectin domain. 

Sequence analysis suggests these DCMPs are members of the 
lectin/ asialoglycoprotein superfamily of receptors. The peptide segments 
can also be used to design and produce appropriate oligonucleotides to 
screen a library to determine the presence of a similar gene, e.g., an 
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idehtical or polymorphic variant, or to identify a DC The genetic code can 
be used to select appropriate oligonucleotides useful as probes for 
scieening. In combination with polymerase chain reaction (PGR) 
techniques/ synthetic oligonucleotides will be useful in selecting desired 
5 clones from a library. 

Complementary sequences will also be used as probes or primers. 
Based upon identification of the likely amino terminus, other peptides 
should be particularly useful, e.g., coupled with anchored vector or p6ly-A 
complementary PCR techniques or with complementary DNA of other 

10 peptides. 

Techniques for nucleic acid manipulation of genes encoding these 
DC proteins, e.g., siibcloning nucleic acid sequences encoding polypeptides 
into expression vectors, labeling probes, DNA hybridization, and the like 
are described generally in Sambrook, et al. (1989) Molecular Cloning - A 

15 Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, 

Cold Spring Harbor Press, NY, which is incorporated herein by reference 
and hereinafter referred to as "Sambrook, et al." See also, Coligan, et al. 
(1987 and periodic supplements) Current Protocols in Molecular Bioloygy 
Greene/Wiley, New York, NY, referred to as "Coligan, et al" 

20 There are various methods of isolating the DNA sequences encoding 

these DC proteins. For example, DNA is isolated from a genomic or cDNA 
library using labeled oligonucleotide probes haAdng sequences identical or 
complementary to the sequences disclosed herein. Full-length probes may 
be used, or oligonucleotide probes may be generated by comparison of the 

25 sequences disclosed with other proteins and selecting specific primers. 

Such probes can be used directly in hybridization assays to isolate DNA 
encoding DC proteins, or probes can be designed for use in amplification 
techniques such as PCR, for the isolation of DNA encoding DC proteins. 
To prepare a cDNA library, mRNA is isolated from cells which 

30 express the DC protein. cDNA is prepared from the mRNA and ligated 
into a recombinant vector. The vector is transfected into a recombinant 
host for propagation, screening and cloning. Methods for making and 
screening cDNA libraries are well known. See Gubler and Hoffman (1983) 
Gene 25:263-269; Sambrook, et al.; or Coligan, et al. 

3 5 For a genomic library, the DNA can be extracted from tissue and 

either mechanically sheared or enTymatically digested to yield fragments 
of about 12-20 kb. The fragments are then separated by gradient 
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centHfugatipn and cloned in bacteriophage lambda vectors. These yeetors 
and phage are packaged in vitro, as described, e.g., in Sambrobl^ et ail. or 
Cplig^n; et al. Recombinant phage are analyzed by plaque hybndizatipn as 
described in Benton and Davis (1977) Science 196:180-182. Colony 
5 hybridization is carried, out as generally described in> e.g., Grunstein> et al. 
(1975) Proc. Natl. Acad. Sci. USA 72:3961-3965. 

DNA encoding a PC protein can be identified in either cDNA or 
genomic libraries by its ability to hybridize with the nucleic acid probes 
described herein, for example in colony or plaque hybridization 

10 experiments. The corresponding DNA regions are isolated by standard 
methods familiar to those of skiU in the art. See Sambrook, et al. 

Various methods of amplifying target sequences, such as the 
polymerase chain reaction, can also be used to prepare DNA encoding DC 
proteins. Polymerase chain reaction (PCR) technology is used to amplify 

15 such nucleic acid sequences directiy from mRNA, from cDNA, and from 
genomic libraries or cDNA libraries. The isolated sequences encoding DC 
proteins may also be used as templates for PCR amplification. 

In PCR techniques, oligonucleotide primers complementary to two 5' 
regions in the DNA region to be ampHfied are synthesized. The 

20 polymerase chain reaction is then carried out using the two primers. See 
Innis, et al. (eds.) (1990) PCR Protocols: A Guide to Methods and 
Applications Academic Press, San Diego, CA. Primers can be selected to 
amplify the entire regions encoding a selected full-length DC protein or to 
amplify smaller DNA segments as desired. Once such regions are PCR- 

25 amplified, they can be sequenced and oligonucleotide prol>es can be 

prepared from sequence obtained using standard techniques. These probes 
can then be used to isolate DNAs encoding other forms of the DC proteins. 

Oligonucleotides for use as probes are chemically synthesized 
according to the solid phase phosphoramidite triester method first 

30 described by Beaucage and Carruthers (1983) Tetrahedron Lett 

22(20): 1859-1 862, or using an automated synthesizer, as described in 
Needham-VanDevanter, et al. (1984) Nucleic Acids Res. 12:6159-6168. 
Purification of oligonucleotides is performed e.g., by native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in Pearson and 

3 5 Regnier (1983) I Chrom. 255:137-149. The sequence of the synthetic 

oligonucleotide can be verified using the chemical degradation method of 
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Mkxam^ and Gflbert in Grossman and Moldave (feds. 1980) Miethods in 
Eng^iologji:6^ New Ywk. 

This invention provides isolated DNA or fragments to encode a DC 
protein, as described, In .addition, this invention provides isolated or 

5 recombinant DNA which encodes a biologically active protein or 

polypeptide which is capable of hybridizing under appropriate conditions, 
e.g., high stringency, with the DNA sequences described heriein. Said 
biologically active protein or polypeptide can be a nahirally occurring 
form, or a recombinant protein or fragment, and have an amino acid 

0 sequence as disclosed in SEQ ID NO: 2, 4, 8, or 10. Preferred embodiments 
will be full length natural isolates, e.g., from a primate. In glycosylated 
form, the proteins should exhibit larger sizes. Further> this invention 
encompasses the use of isolated or recombinant DNA, or fragments thereof, 
which encode proteins which are homologous to each respective DG 

5 protein. The isolated DNA can have the respective regulatory sequences in 
the 5' and 3' flanks, e.g., promoters, enhancers, poly-A addition signals, 
and others. 



IV. Making DC Gene Products 

2 0 DNAs which encode these DC proteins or fragments thereof can be 

obtained by chemical synthesis, screening cDNA libraries, or by screening 
genomic libraries prepared from a wide variety of cell lines or tissue 
samples. 

These DNAs can be expressed in a wide variety of host cells for the 
25 synthesis of a full-length protein or fragments which can, e.g., be used to 
generate polyclonal or monoclonal antibodies; for binding studies; for 
construction and expression of modified molecules; and for 
structure/ function studies. Each of these DC proteins or their fragments 
can be expressed in host cells that are transformed or transfected with 

3 0 appropriate expression vectors. These molecules can be substantially 

purified to be free of protein or cellular contaminants, other than those 
derived from the recombinant host, and therefore are particularly useful in 
pharmaceutical compositions when combined with a pharmaceutically 
acceptable carrier and/ or diluent. The antigen, or portions thereof, may be 
35 expressed as fusions with other proteins. 

Expression vectors are typically self-replicating DNA or RNA 
constructs containing the desired DC gene or its fragments, usually 



SUBSTITUTE SHEET (RULE 26) 



Wp 99/02562 PCTAJS98/13436 

17 ^ 



pperably lii^^ genetic control elements that are recognized in a - 

suitable host cell. These control elements are capable of . effecting 
expression within a suitable host This specific type of control elements 
necesisary to effect expression will depend upon the eventual host cell used. 
5 Generally, the genetic control elements can include a prokaryptic promotei: 
system or a eukary otic promoter expression control system, and typically 
include a transcriptional promoter, an optional operator to control the onset 
of transcription, transcription enhancers to elevate the level of mRNA 
expression^ a sequence that encodes a suitable ribosome binding site, and 

i b sequenees that terminate transcription and translation- Expression vectors 
also usually contain an origin of replication that allows the vector to 
replicate independentiy from the host cell. 

The Vectons of this invention contain DNAs which encode the 
various DC proteins, or a fragment tiiereof, typically encoding, e.g., a 

15 biologically active polypeptide, or protein. The DNA can be under the 
control of a viral promoter and can encode a selection marker. This 
invention further contemplates use of such expression vectors which are 
capable of expressing eukaryotic cDNA coding for a DC protein in a 
prokaryotic or eukaiyotic host, where the vector is compatible with the host 

20 and where the eukaryotic cDNA coding for the protein is inserted into the 
vector such that growth of the host containing the vector expresses the 
cDNA in question. Usually, expression vectors are designed for stable 
replication in their host cells or for amplification to greafly increase the 
total number of copies of the desirable gene per cell. It is not always 

25 necessary to require that an expression vector replicate in a host cell, e.g., it 
is possible to effect transient expression of the protein or its fragments in 
various hosts using vectors that do not contain a replication origin that is 
recognized by the host cell. It is also possible to use vectors that cause 
integration of a DC gene or its fragments into the host DNA by 

3 0 recombination, or to integrate a promoter which controls expression of an 
endogenous gene. 

Vectors, as used herein, comprise plasmids, viruses, bacteriophage, 
integratable DNA fragments, and other vehicles which enable the 
integration of DNA fragments into the genome of the host. Expression 

35 vectors are specialized vectors which contain genetic control elements that 
effect expression of operably linked genes. Plasmids are the most 
commonly used form of vector but all other forms of vectors which serve 
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an equivalent function are suitable for use herein, See, e,g., P(puwels> et aL 
(1985 and SupplementsV Cloning Vectors: A t^bbrafonr Mainual Elsevier, 
N:Y.; and Rodriquez> et al. (eds.) (1988) Vectors: A Survev of Molecular 
gpning Vectors aiid Their Uses Buttersworth, Boston> MA. 
5 Suitable host cells include prokaiyotes, lower eukaFy:otes> and 

higher eukaryotes, Prokaryotes include both gram n^ative and gram 
positive organisms, e.g./ E. coli and B. subtilis. Lower eukaryot^ include 
yeasts, e.g., S. cerevisiae and Pichia, and species of the genus 
Dictyostelium, Higher eukaryotes include established tissue culhxre cell 

10 lines from animal cells, both of non-mammalian origin, e.g., insect cells, 
and birds, and of mammaUan origin, e.g., human, primates, and rodents. 

Prokaryotic host-vector systems include a wide yariety of vectors for 
many different species. As used herein, E. coli and its vectors will be used 
generically to include equivalent vectors used in other prokaryotes. A 

15 representative vector for amplifying DNA is pBR322 or its derivatives. 

Vectors that can be used to express DC proteins or fragments include, but 
are not limited to, such vectors as those containing the lac promoter (pUC- 
series); trp promoter (pBR322-trp); Ipp promoter (the pIN-series); lambda- 
pP or pR promoters (pOTS); or hybrid promoters such as ptac (pDR540). 

20 See Brosius, et aL (1988) "Expression Vectors Employing Lambda-, trp-, lac- 
, and Ipp-derived Promoters", in Rodriguez and Denhardt (eds.) Vectors: A 
Survey of Molecular Cloning Vectors and Their Uses 10:205-236 
Buttersworth, Boston, MA. 

Lower eukaryotes, e.g., yeasts and Dictyostetium, may be 

25 transformed with DC gene sequence containing vectors. For purposes of 
this invention, the most common lower eukaryotic host is the baker's yeast, 
Saccharomyces cerevisiae. It will be used generically to represent lower 
eukaryotes although a number of other strains and species are also 
available. Yeast vectors typically consist of a replication origin (unless of 

30 the integrating type), a selection gene, a promoter, DNA encoding the 

desired protein or its fragments, and sequences for translation termination, 
polyadenylation, and transcription termination. Suitable expression 
vectors for yeast include such constitutive promoters as 3-phosphoglycerate 
kinase and various other glycolytic enzyme gene promoters or such 

3 5 inducible promoters as the alcohol dehydrogenase 2 promoter or 

metallothionine promoter. Suitable vectors include derivatives of the 
following types: self-replicating low copy number (such as the YRp-series), 
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self-replicating high copy nxmiber (such as the YEp-series); integrating 
types (such as the YIp-series)V or Tnini-clu-omosonies (such as the YGp- 
seiies). 

Higher eukaryotic tissue culture cells are the preferred host cells for 
5 expression of the DC protein. In principle, most any higher eukaryotic 
tissue culture cell line may be used, e.g., insect baculovirus expression 
systems, whether from an invertebrate or vertebrate source. However, 
mammalian cells are preferred to achieve proper processing, both 
cotranslationally and posttranslationally. Transformation or transfection 

10 and propagation of such cells is routine. Useful cell lines include HeLa 
cells, Chinese hamster ovary (CHO) cell lines, baby rat kidney (BRK) cell 
lines, insect cell lines, bird cell lines, and monkey (COS) cell lines. 
Expression vectors for such cell lines usually include an origin of 
replication, a promoter, a translation mitiation site, RNA splice sites (e.g., if 

15 genomic DNA is used), a polyadenylation site, and a transcription 
termination site. These vectors also may contain a selection gene or 
amplification gene. Suitable expression vectors may be plasmids, viruses, 
or retroviruses carrying promoters derived, e.g., from such sources as from 
adenovirus, SV40, parvoviruses, vaccinia virus, or cytomegalovirus. 

20 Representative examples of suitable expression vectors include pCDNAl; 

pCD, see Okayama, et al. (1985) MoL Cell Biol. 5:1136-1142; pMClneo Poly- 
A, see Thomas, et al. (1987) Cell 51:503-512; and a baculovirus vector such 
as pAC 373 or pAC 610. 

In certain instances, the DC proteins need not be glycosylated to 

25 elicit biological responses in certain assays. However, it will often be 
desirable to express a DC polypeptide in a system which provides a 
specific or defined glycosylation pattern. In this case, the usual pattern will 
be that provided naturally by the expression system. However, the pattern 
will be modifiable by exposing the polypeptide, e.g., in unglycosylated 

30 form, to appropriate glycosylating proteins introduced into a heterologous 
expression system. For example, a DC gene may be co-transformed with 
one or more genes encoding mammalian or other glycosylating enzymes. 
It is further imderstood that over glycosylation may be detrimental to DC 
protein biological activity, and that one of skill may perform routine testing 

35 to optimize the degree of glycosylation which confers optimal biological 
activity. 
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A DC protein, or a fragment thereof, may be engineered to be 
phosphatidyl inositol (PI) linked ,tx> a cell membrane, but can be removed 
from membranes by treatment with a phosphatidyl inositol cleaving 
enzyme, e.g., phosphatidyl inositol phospholipase-C; This releases the 
5 antigen in a biologically active form/ and allows purification by standard 
procedures of protein chemistry. See, e.g.. Low (1989) Biochem. Biophys. 
Acta 988:427-454; Tse, et al. (1985) Science 230:1003-1008; Brunner, et al. 
(1991) T. CeU Biol. 114:1275-1283; and Coligan, et al. (eds.) (1996 and 
periodic supplements) Current Protocols in Protein Science. John Wiley & 

10 Sons, New York, NY. 

Now that these DC proteins have been characterized, fragments or 
derivatives thereof can be prepared by conventional processes for 
synthesizing peptides. These include processes such as are described in 
Stewart and Young (1984) Solid Phase Peptide Synthesis Pierce Chemical 

15 Co., Rockford, IL; Bodanszky and Bodanszky (1984) The Practice of Peptide 
Synthesis Springer-Verlag, New York, NY; and Bodanszky (1984) The 
Principles of Peptide Synthesis Springer-Verlag, New York, NY. See also 
Merrified (1986) Science 232:341-347; and Dawson, et al. (1994) Science 
266:776-779. For example, an azide process, an acid chloride process, an 

20 acid anhydride process, a mixed anhydride process, an active ester process 
(for example, p-nitrophenyl ester, N-hydroxysuccinimide ester, or 
cyanomethyl ester), a carbodiimidazole process, an oxidative-reductive 
process, or a dicyclohexylcarbodiimide (DCCD)/ additive process can be 
used. Solid phase and solution phase syntheses are both applicable to the 

25 foregoing processes. 

The prepared protein and fragments thereof can be isolated and 
purified from the reaction mixture by means of peptide separation, for 
example, by extraction, precipitation, electrophoresis and various forms of 
chromatography, and the like. The DC proteins of this invention can be 

30 obtained in varying degrees of purity depending upon the desired use. 
Purification can be accomplished by use of known protein purification 
techniques or by the use of the antibodies or binding partners herein 
described, e.g., in immunoabsorbant affinity chromatography. This 
immunoabsorbant affinity chromatography is carried out by first linking 

35 the antibodies to a soUd support and contacting the linked antibodies with 
solubilized lysates of appropriate source cells, lysates of other cells 
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e^cpi^ssiitg the; protein^ dr lysates or sppemataitts of cells producing the 
proteins as a result of DNA techniques/ see' below. 

Miiltiple cell lines may be screened for one which expresses said 
protein at a high level compared With pther cells. Various cell lines, e,g.> a 
5 mouse thymic stromal cell line TA4, is screened and selected for its 

favorable handling properties, Natural ,DG cell proteins can be isolated 
from natural sources, or by expression from a transformed cell using an 
appropriate expression vector. Purification of the expressed protein is 
achieved by standard procedures, or may be combined with engineered 
10 means for effective purification at high effidehcy from cell lysates or 

supematants. FLAG or Hisg: segments can be used for such purification 

features. 

V, Antibodies 

15 Antibodies can be raised to the various DC proteins, including 

individual, polymorphic, allelic, strain, or species variants, and fragments 
thereof, both in their naturally occurring (full-length) forms and in their 
recombinant forms. Additionally, antibodies can be raised to DC proteins 
in either their active forms or in their inactive forms. Anti-idiotypic 

20 antibodies may also be used. 

a. Antibody Production 

A number of immunogens may be used to produce antibodies 
specifically reactive with these DC proteins. Recombinant protein is the 
preferred immunogen for the production of monoclonal or polyclonal 

25 antibodies. Naturally occurring protein may also be used either in pure or 
impure form. Synthetic peptides made using the human DC protein 
sequences described herein may also used as an immunogen for the 
production of antibodies to the DC protein. Recombinant protein can be 
expressed in eukaryotic or prokaryotic cells as described herein, and 

30 purified as described. The product is then injected into an animal capable 
of producing antibodies. Either monoclonal or polyclonal antibodies may 
be generated for subsequent use in immunoassays to measure the protein. 

Methods of producing polyclonal antibodies are known to those of 
skill in the art. In brief, an immunogen, preferably a purified protein, is 

35 mixed with an adjuvant and animals are immunized with the mixture. The 
animal's immune response to the immunogen preparation is monitored by 
taking test bleeds and determining the titer of reactivity to the DC protein 
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0f interest Whfen approjpriately high titers of antibody to the immunogen 
are obtained^ blood is collected from the animal and antisera are prepai^d. 
Further fractionation of the antisera to enrich for antibodies reactive to the 
protein can be done if desired. See, e.g., Harlow and Lane. 
5 Monoclonal antibodies may be obtained by various techniques 

familiar to those skilled in the art Briefly, spleen cells from an animal 
immunized with a desired antigen are immortalized, commonly by fusion 
with a myeloma celt See, e.g:, Kohler and MUstein (1976) Eur. T. ImmunoL 
6:511-519, which is incorporated herein by reference. Alternative methods 

10 of immortalization include transformation with Epstein Barr Virus, 

oncogenes, or retroviruses, or other methods known in the art Colonies 
arising from single immortalized cells are screened for production of 
antibodies of tiie desired specificity and affinity for the antigen, and yield 
of the monoclonal antibodies produced by such cells may be enhanced by 

15 various techniques, including injection into the peritoneal cavity of a 
vertebrate host Alternatively, one may isolate DNA sequences which 
encode a monoclonal antibody or a binding fragment thereof by screening 
a DNA library from human B cells according to the general protocol 
outiined by Huse, et at (1989) Science 246:1275-1281, 

20 Antibodies, including binding fragments and single chain versions, 

against predetermined fragments of these DC proteins can be raised by 
immunization of animals with conjugates of the fragments with carrier 
proteins as described above. Monoclonal antibodies are prepared from 
cells secreting the desired antibody. These antibodies can be screened for 

25 binding to normal or defective DC proteins, or screened for agonistic or 

antagonistic activity. These monoclonal antibodies will usually bind with 
at least a Kj) of about 1 mM, more usually at least about 300 |iM, typically 

at least about 10 jiM, more typically at least about 30 ^M, preferably at least 
about 10 ^M, and more preferably at least about 3 jxM or better. 

30 ^ some instances, it is desirable to prepare monoclonal antibodies 

from various mammalian hosts, such as mice, rodents, primates, humans, 
etc. Description of techniques for preparing such monoclonal antibodies 
may be found in, e.g., Stites, et at (eds.) Basic and Clinical Immunology 
(4th ed.) Lange Medical Publications, Los Altos, CA, and references cited 

3 5 therein; Hariow and Lane (1988) Antibodies: A Laboratory Manual CSH 
Press; Coding (1986) Monoclonal Antibodies: Principles and Practice (2d 
ed.) Academic Press, New York, NY; and particulariy in Kohler and 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02562 



23 



PGTAJS98/I3436 



Milsteirt (197j5) N^bi]^:256A95A97, Vifhieh discusses orie method of 
generatmg monoclonal antibodies;^ Summarized briefly; this method 
involves injecting an animal with- an immtmogen to initiate a humoral 
immune response. The animal is then sacrificed and cells taken from its 
5 spleen. Which are then fused with myeloma cells. The result is a hybrid 

cell or "hybridoma" that is capable of reproducing in vitro. The population 
of hybridomas is then screened to isolate individual clones, each of which 
secretes a single antibody species to the immunogen. In this manner, the 
individual antibody species obtained are the products of immortalized and 

10' cloned single B cells from the immune animal generated in response to a 
specific site recognized on the immLunogenic substance. 

Other suitable techniques involve selection of libraries of antibodies 
in phage or similar vectors. See, Htise, et al. (1989) "Generation of a Large 
Combinatorial Library of the Immunoglobulin Repertoire in Phage 

is Lambda," Science 246:1275-1281; and Ward, et aL (1989) Nature 341:544- 

546. The polypeptides and antibodies of thepresent invention may be used 
with or without modification, including chimeric or humanized antibodies. 
Frequently, the polypeptides and antibodies will be labeled by joining, 
either covalentiy or non-covalently, a substance which provides for a 

2 0 detectable signal A wide variety of labels and conjugation techniques are 
known and are reported extensively in both the scientific and patent 
literature. Suitable labels include radionuclides, enzymes, substrates, 
cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, 
magnetic particles, and the like. Patents, teaching the use of such labels 

25 include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. Also, recombinant immunoglobulins may be 
produced. See, Cabilly, U.S. Patent No. 4,816,567; and Queen, et al. (1989) 
Proc. Nat'l Acad. Sci. USA 86:10029-10033. 

The antibodies of this invention can also be used for affinity 

30 chromatography in isolating each DC protein. Columns can be prepared 
where the antibodies are linked to a soHd support, e.g., particles, such as 
agarose, SEPHADEX, or the like, where a cell lysate may be passed 
through the column, the column washed, followed by increasing 
concentrations of a mild denaturant, whereby purified DC protein will be 

35 released. 

The antibodies may also be used to screen expression libraries for 
particular expression products. Usually the antibodies used in such a 
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proceduFe will be labeled with a moiefy allowing easy detection of 
presence of antigen by antibody binding* 

Antibodies to DC proteins may be used; for the analysis or, or 
identification of specific cell population components which express the 
5 respective protein. By assaying the expression products of cells expriessing 
DC proteins it is possible to diagnose disease^ e.g., immune-compromised 
conditions, DC depleted conditions, or overproduction of DC. 

Antibodies raised against each DC will also be useful to raise anti- 
idiotypic antibodies. These will be useful in detecting or diagnosing 
10 various immunological conditions related to expression of the respedivie. 
antigens. 

b. Immunoassays 

A particular protein can be measured by a variety of immtmoassay 
mediods. For a review of immunological and immunoassay procedures in 

15 general see Stites and Terr (eds.) 1991 Basic and Qinical Immunology (7th 
ed.). Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, which are reviewed extensively 
in Maggio (ed.) (1980) Enzyme Immunoassay CRC Press,Boca Raton, 
Florida; Tijan (1985) "Practice and Theory of Enzyme Immunoassays," 

20 Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier 
Science Publishers B.V., Amsterdam; and Harlow and Lane Antibodies. A 
Laboratory Manual supra, each of which is incorporated herein by 
reference. See also Chan (ed.) (1987) Immunoassay: A Practical Guide 
Academic Press, Orlando, FL; Price and Newman (eds,) (1991) Principles 

25 and Practice of Immunoassays Stockton Press, NY; and Ngo (ed.) (1988) 
Non-isotopic Immimoassays Plenum Press, NY. 

Immunoassays for measurement of these DC proteins can be 
performed by a variety of methods known to those skilled in the art In 
brief, immunoassays to measure the protein can be competitive or 

30 noncompetitive binding assays. In competitive binding assays, the sample 
to be analyzed competes with a labeled anaiyte for specific binding sites on 
a capture agent bound to a solid surface. Preferably the capture agent is an 
antibody specifically reactive with the DC protein produced as described 
above. The concentration of labeled anaiyte bound to the capture agent is 

3 5 inversely proportional to the amount of free anaiyte present in the sample. 

In a competitive binding immunoassay, the DC protein present in 
the sample competes with labeled protein for binding to a specific binding 
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ajgeitt^ for example, an antibody specifically reaiqtive with the DG prptein> 
The binding agent may be bound to a solid surface to<etfect separation of 
bdiind labeled protein from tiie unbound labelled protein. Alternately, the 
competitive binding assay may be conducted in liquid phase and any of a 
5 variety of techniques known in the art may be used to separate the bound 
labeled protein from the unbound labeled protein. FoUowinjg separation, 
the amotmt of bound labeled protein is determined. The amount of protein 
present in the sample is inversely proportional to tite amount of labeled 
protein binding. 

10 Alternatively, a homogeneous immunoassay may be performed in 

which a separation step is not needed. In ihese^ immuiioassays, the. label on 
Oie protein is altered by the binding of the protein to its specific binding 
agent This alteration in the labelled protein r^ults in a decrease or 
increase in the signal emitted by label, so that measurement of the label at 

15 the end of the immunoassay allows for detection or quantitation of the 
protein. 

These DC proteins may also be quantitatively determined by a 
variety of noncompetitive immimoassay methods. For example, a two-site, 
solid phase sandwich immunoassay may be used. In this type of assay, a 

2 0 binding agent for the protein, for example an antibody, is attached to a 
solid support A second protein binding agent, which may also be an 
antibody, and which binds the protein at a different site, is labeled. After 
binding at botii sites on the protein has occurred, the unbound labeled 
binding agent is removed and the amount of labeled binding agent bound 

25 to the solid phase is measured. The amotmt of labeled binding agent 
botmd is directly proportional to the amount of protein in the sample. 

Western blot analysis can be used to determine the presence of DC 
proteins in a sample. Electrophoresis is carried out e.g., on a tissue sample 
suspected of containing the protein. Following electrophoresis to separate 

30 the proteins, and transfer of the proteins to a suitable solid support such as 
a nitrocellulose filter, the soHd support is incubated with an antibody 
reactive with the denatured protein. This antibody may be labeled, or 
alternatively may be it may be detected by subsequent incubation with a 
second labeled antibody that binds the primary antibody. 

35 The immunoassay formats described above employ labeled assay 

components. The label can be in a variety of forms. The label may be 
coupled directly or indirectly to the desired component of the assay 
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according to methods well known in the art A y/ide variety of libels may 
be used. The component may be labeled by any brie of seyeral methpdsi 
Traditionally a radioactive label incorporating %, T25i. SSg^ 14q or 32p 
used. Non-radioactive labels include ligailds which bind to labeled 
5 antibodies, fluorophores, chemiluminescent agents/ enzymes, arid 

antibodies which can serve as specific binding pair members for a labeled 
protein. The choice of label depends on sensitivity required, ease of 
conjugation with the compound, stability requirements, and available 
instrumentation. For a review of various labeling or signal producing 

10 systems which may be used, see US. Patent No. 4,391,904, which is 
incorporated herein by reference. 

Antibodies reactive with a particular protein can also be measured 
by a variety of immunoassay methods. For reviews of immunological and 
immunoassay procediu-es applicable to the measurement of antibodies by 

15 immunoassay techniques, see, e.g., Stites and Terr (eds.) Basic and Qinical 
Immunology (7th ed.) supra; Maggio (ed.) Enzyme Immimoassav. supra; 
and Harlow and Lane Antibodies. A Laboratory Manual, supra. 

A variety of different immimoassay formats, separation techniques, 
and labels can be also be used similar to those described above for the 

2 0 measurement of specific proteins. 

VL Purified DC proteins 

Primate, e.g., human, DCMPl nucleotide and amino acid sequences 
are provided in SEQ ID NO: 1 and 2. Rodent, e.g., mouse, DCMPl 

25 nucleotide and amino acid seqeunces are provided in SEQ ID NO: 7 and 8. 
Primate, e.g., human, DCMP2 nucleotide and amino acid sequences are 
provided in SEQ ID NO: 3 and 4. Another variant is described in SEQ ID 
NO: 9 and 10. Similar primate hepatic asialoglycyprotein sequences are 
provided in SEQ ID NO: 5 and 6. The peptide sequences allow preparation 

30 of peptides to generate antibodies to recognize such segments, and allow 
preparation of oligonucleotides which encode such sequences. 

Vn. Physical Variants 

This invention also encompasses proteins or peptides having 
35 substantial amino acid sequence similarity with an amino acid sequence of 
a SEQ ID NO: 2 or 8 or selected portions of SEQ ID NO: 4 or 10. Variants 
exhibiting substitutions, e.g., 20 or fewer, preferably 10 or fewer, and more 
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preferably 5 or fewer substjtutidiis, are also enabled- Where the? 
substitutions are conservative substitutions, the variants will share 
immunogenic or antigenic similarity or cross-reactiyity with a 
corresponding natural sequence protein. Natural variants include 
5 individual^ allelic, polymorphic, strain, or species variants. 

Amino acid sequence similarity, or sequence identi^, is determined 
by optimizing residue matches, if necessary, by introducing gaps as 
required. This changes when considering conservative substitutions as 
matches. Conservative substitutions typically include substitutions within 

10 the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic 
acid, glutamic acid; asparagiae, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. Homologous amino acid sequences 
include natural allelic and interspecies variations in each respective protein 
sequence. Typical homologous proteins or peptides will have from 50- 

15 100% similarity (if gaps can be introduced), to 75-100% similarity (if 

conservative substitutions are included) with the amino acid sequence of 
the relevant DC protein Identity measures will be at least about 50%, 
generally at least 60%, more generally at least 65%, usually at least 70%, 
more usually at least 75%, preferably at least 80%, and more preferably at 

20 least 80%, and in particularly preferred embodiments, at least 85% or more. 
See also Needleham, et al. (1970) I MoL BioL 48:443-453; Sankoff, et al. 
(1983) Time Warps, String Edits, and Macromolecules: The Theory and 
Practice of Sequence Comparison Chapter One, Addison- Wesley, Reading, 
MA; and software packages from IntelliGenetics, Mountain View, CA; and 

25 the University of Wisconsin Genetics Computer Group (GCG), Madison, 
WL 

For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence 
comparison algorithm, test and reference sequences are input into a 

3 0 computer, subsequence coordinates are designated, if necessary, and 

sequence algorithm program parameters are designated. The sequence 
comparison algorithm then calculates the percent sequence identity for the 
test sequence(s) relative to the reference sequence, based on the designated 
program parameters. 

3 5 Optical alignment of sequences for comparison can be conducted, 

e.g., by the local homology algorithm of Smith and Waterman (1981) Adv> 
Appl. Math. 2:482, by the homology alignment algorithm of Needlman and 
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W^insch (1970): TvMbL Biol. 48:443, by the search for siiiiaariiy metfioci of 
Pearson and lipman (1988) Proc. Natfl Acad, ScL USA 85:2444. by 
computerized implemientetions of thes0 algoritiims (GAP, BESTFIT, 
FASTA, and TFAStA in the Wisconsin Genetics Software Package, 
s; Genetics Computer Group, 575 Science Df Madison, WI), or by visual 
inspection (see generally Ausubel et aL, supra). 

One example of a useful algorithm is PILEUP. PILEUP creates a 
multiple sequence alignment from a group of related sequences using 
progressive, pairwise ahgnments to show relationship and percent 

i 0 sequence identity. It also plots a tree or dendogram showing the clustering 
relatipnships used to create the alignment PILEUP uses a simplification of 
the progressive alignment method of Feng and Doolitfle (1987) L MoL EvoL 
35:351-360. The method used is similar to the method described by 
Higgins and Sharp (1989) CABIOS 5:151-153. The program can align up to 

15 300 sequences, each of a maximum length of 5,000 nucleotides or amino 
acids. The multiple alignment procedure begins with the pairwise 
alignment of the two most similar sequences, producing a cluster of two 
aligned sequences. This cluster is then aligned to the next most related 
sequence or cluster of aligned sequences. Two clusters of sequences are 

20 aligned by a simple extension of the pairwise alignment of two individual 
sequences. The final alignment is achieved by a series of progressive, 
pairwise alignments. The program is run by designating specific sequences 
and their amino acid or nucleotide coordinates for regions of sequence 
comparison and by designating the program parameters. For example, a 

25 reference sequence can be compared to other test sequences to determine 
the percent sequence identity relationship using the following parameters: 
default gap weight (3.00), default gap length weight (0.10), and weighted 
end gaps. 

Another example of algorithm that is suitable for determining 
3 0 percent sequence identity and sequence similarity is the BLAST algorithm, 
which is described Altschul, et al. (1990) J. Mol. Biol. 215:403-410. Software 
for performing BLAST analyses is pubhcly available through the National 
Center for Biotechnology Information (http:www.ncbi.nlm.nih.gov/). This 
algorithm involves first identilying high scoring sequence pairs (HSPs) by 
3 5 identifying short words of length W in the queiy sequence, which either 

match or satisfy some positive-valued threshold score T when aUgned with 
a word of the same length in a database sequence. T is referred to as the 
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li^eighborKood word score threshold (Altschuly et aL> Supra), These initial 
neighborhood word hits act as seeds iFor initiating searches to iuidlonger 
HSPs containing them. The word hits are then extended in both directions^ 
along each sequence for as far as the cumulative alignment $core caft be 
5 increased. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its 
maximum achieved value; the cumulative score goes to zero or below, due 
to the accumulation of one or more negative-scoring residue alignments; or 
Ihe end of either sequence is reached. The BLAST algorithm parameters W, 

10 T, and X determine the sensitivity and speed of the alignment The BLAST 
program uses as defaults a wordlength (W) of 11, the BLOSUM62 scoring 
matiix (see Henikoff and Henikoff (1989) Proc. Nafl Acad. ScL USA 
89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a 
comparison of both strands. 

15 In addition to calculating percent sequence identity, tiie BLAST 

algorithm also performs a statistical analysis of the similarity between two 
sequences (see, e.g., Karlin and Altschul (1993) Proc> Natl Acad. Sci. USA 
90:5873-5787), One measure of similarity provided by the BLAST 
algorithm is the smallest sum probability (P(N)), which provides an 

20 indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability 
in a comparison of the test nucleic acid to the reference nucleic acid is less 
than about 0.1, more preferably less than about 0.01, and most preferably 

25 less than about 0.001. 

A further indication that two nucleic acid sequences of polypeptides 
are substantially identical is that the polypeptide encoded by the first 
nucleic acid is immunologically cross reactive witii the polypeptide 
encoded by the second nucleic acid, as described below. Thus, a 

30 polypeptide is typically substantially identical to a second polypeptide, for 
example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially 
identical is that the two molecules hybridize to each other under stringent 
conditions, as described below. 

35 Nucleic acids encoding the corresponding mammalian DC proteins 

will typically hybridize to SEQ ID NO: 1 or 7, or appropriate portion of 3 
imder stringent conditions. For example^ nucleic acids encoding the 
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respective TDC proteins will typically hybridize ;to the nucleic acid of SEQ 
ID NO: 1^ 7/ 3, or 9, undar stringent hybridization conditions, while 
providing few fklse positive hybridizatibn sigtials* GenetaXfy, stringent 
conditions are selected to be about 1 0_ C lower than tfte tfiermalmelting 
5 point (Tm) for the sequence being hybridized to at a defined ionic strength 
and pH. The Tm is the temperature (under defined ionic strength and pH) 
at which 50% of the target sequence hybridizes to a perfectly matched 
probe. Typically, stringent conditions will be those in which the salt 
concentration in wash is about 0,02 molar at pH 7 and the temperature is at 

1 0 least about 50_ G. Other factors may significantly affect the stringency of 
hybridization, including, among others, base composition and size of the 
complemiehtary strands, the presence of organic solvents such as 
formamide, and the extent of base mismatching. A preferred embodiment 
will include nucleic acids which will bind to disclosed sequences in 50% 

15 formamide and 20-50 mM NaCl at 42_ C Hybridization Under stringent 
conditions should give a background of at least 2-fold over background, 
preferably at least 3-5 or more. 

An isolated DC gene DNA can be readily modified by nucleotide 
substitutions, nucleotide deletions, nucleotide insertions, and inversions of 

20 nucleotide stretches. These modifications result in novel DNA sequences 
which encode these DC antigens, their derivatives, or proteins having 
highly similar physiological, immunogenic, or antigenic activity. 

Modified sequences can be used to produce mutant antigens or to 
enhance expression. EiUianced expression may involve gene amplification, 

25 increased transcription, increased translation, and other mechanisms. Such 
mutant DC protein derivatives include predetermined or site-specific 
mutations of ttie respective protein or its fragments, "Mutant DC protein" 
encompasses a polypeptide otherwise falling within the homology 
definition of the DC protein as set forth above, but having an amino acid 

30 sequence which differs from that of the DC protein as found in nature, 

whether by way of deletion, substitution, or insertion. In particular, "site 
specific mutant DC protein" generally includes proteins having significant 
similarity with a protein having a sequence of SEQ ID NO: 2 or 8. 
Generally, the variant will share many physicochemical and biological 

35 activities, e.g., antigenic or immunogenic, with those sequences, and in 
preferred embodiments contain most or all of the disclosed sequence. 
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Similar concepts apply to. tiies^^ various DC proteins, particularly those 
foiind in various warm blooded animals> e.g., primates and mammals. 

Although site spedfic mutation sites are predetermined, mutants 
need hot be site specific. DC protein mutagenesis can be coilductEid by 
5 making amino acid insertions or deletions. Substitutions, deletions, 
insertions, or any combinations may be generated to arrive at a final 
construct. Insertions include amino- or carboxyl- terminal fusions. 
Random mutagenesis can be conducted at a target codon and the expressed 
mutants can then be screened for the desired activity. Methods for making 

10 substitution mutations at predetermined sites in DNA having a known 
sequence are well known, in the art, e.g., by M13 primer mutagenesis or 
pol)mierase chain reaction (PCR) techniques. See also> Sambrbok>. et aL 
(1989) and Ausubel, et al. (1987 and Supplements). The mutations in the 
DNA normally should not place coding sequences out of reading A'ames 

15 and preferably will not create complementary regions that could hybridize 
to produce secondary mRNA structure such as loops or hairpins. 

The present invention also provides recombinant proteins/ e.g., 
heterologous fusion proteins using segments from these proteins. A 
heterologous fusion protein is a fusion of proteins or segments which are 

20 naturally not normally fused in the same manner. Thus, the fusion product 
of an immunoglobulin with a respective DC polypeptide is a continuous 
protein molecule having sequences fused in a typical peptide linkage, 
typically made as a single translation product and exhibiting properties 
derived from each source peptide. A similar concept applies to 

25 heterologous nucleic acid sequences. 

In addition, new constructs may be made from combining similar 
functional domains from other proteins. For example, domains or other 
segments may be"swapped" between different new fusion polypeptides or 
fragments, typically with related proteins, e.g., withn the lectin or 

30 asialoglycoprotein families. Preferably, intact structural domains will be 
used, e.g., intact Ig portions. See, e.g., Cunningham, et al. (1989) Science 
243:1330-1336; and 0»Dowd, et al. (1988) T. Biol. Chem. 263:15985-15992. 
Thus, new chimeric polypeptides exhibiting new combinations of 
specificities will result from tihe functional linkage of protein-binding 

35 specificities and other functional domains. Also, alamne scanning 

mutagenesis may be applied, preferably to residues which structurally are 
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exterior to the secondary structure, which will avoid mpst of the critical 
refsidues whiGh geheirailly disrupt teiiiaiy stra 

"Derivatives" of these DC antigens include amino acid sequence 
mutants, glycpsylatioii variants, and covalent or aggregate conjugates with 
5 other chemical moieties. Covalent derivatives can be prepared by linkage 
of functionalities to groups which are found in these DC proton amino 
acid side chains or at the N- or C- termini, by means which are well known 
in the art These derivatives can include, without limitation, aliphatic 
esters or amides of the carboxyl terminus, or of residues containing- 

10 carboxyl side chains, O-acyl derivatives of hydroxy! group-containing 
residues, and N-acyl derivatives of the amino terminal amino acid or 
amino-group containing residues, e.g., lysine or arginine. Acyl groups are 
selected from the group of alkyl-moieties including C3 to C18 normal alkyi, 
thereby forming alkanoyl aroyl species. Covalent attachment to carrier 

15 proteins may be important when immunogenic moieties are haptens. 

In particxilar, glycosylation alterations are included, e.g., made by 
modifying the glycosylation patterns of a polypeptide during its synthesis 
and processing, or in furtiier processing steps. Particularly preferred 
means for accomphshing this are by exposing the polypeptide to 

20 glycosylating enzymes derived from cells which normally provide such 
processing, e.g., mammahan glycosylation enzymes. Deglycosylation 
enzymes are also contemplated. Also embraced are versions of the same 
primary amino acid sequence which have other minor modifications, 
including phosphorylated amino acid residues, e.g., phosphotyrosine, 

25 phosphoserine, or phosphothreonine, or other moieties, including ribosyl 
groups or cross-linking reagents. Also, proteins comprising substitutions 
are encompassed, which should retain substantial immunogenicity, to 
produce antibodies which recognize a protein of SEQ ID NO: 2, 4, 8, or 10. 
Typically, these proteins will contain less than 20 residue substitutions 

3 0 from the disclosed sequence, more typically less than 10 substitutions, 

preferably less than 5, and more preferably less than three. Alternatively, 
proteins which begin and end at structural domains will usually retain 
antigenicity and cross immunogenicity. 

A major group of derivatives are covalent conjugates of the DC 

35 proteins or fragments thereof with other proteins or polypeptides. These 
derivatives can be synthesized in recombinant culture such as N- or C- 
terminal fusions or by the use of agents known in the art for their 
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useftdness in cross-lmking proteins through reactive side groups; 
Preferred protein deriyati^tion sites with crpss^intdng agent5 .^e at jfiee. 
amino groups, carbohydrate moieties, and cysteine lesidues; 

Fusion polypeptides between tiiese DC protein^ and other 
5 homologous or heterologous proteins are also provided^ Heterologous 

polypeptides may be fusions between different siuf ace markers, Tesuliing 
in, eig., a hybrid protein. Likewise, heterologous fusions may be 
constructed which would exhibit a combination of properties or activities 
of the derivative proteins. Typical examples are fusions of a reporter 

10 polypeptide, e.g., luciferajse, with a segment or domain of a protein, e.g., a 
receptor-binding segment^ so that the presence or location of the fused 
protein may be easily detefrmined. See, e.g., DiiU, et ai., US. Patent No. 
4,859,609. Other gene fusion parbiers include bacterial B-galactosidase, 
trpE, Protein A, C-lactamase, alpha amylase, alcohol dehydrogenase, and 

15 yeast alpha mating factor. See, e g., Godowski, et al. (1988) Science 241:812- 
816. 

Such polypeptides may also have amino acid residues which have 
been chemically modified by phosphorylation, sulfonation, biotinylation, 
or the addition or removal of other moieties, particularly those which have 

20 molecular shapes similar to phosphate groups. In some embodiments, the 
modifications will be useful labeling reagents, or serve as purification 
targets, e.g., affinity ligands. 

This invention also contemplates the use of derivatives of these DC 
proteins other than variations in amino acid sequence or glycosylation. 

25 Such derivatives may involve covalent or aggregative association with 
chemical moieties. These derivatives generally fall into tfie three classes: 
(1) salts, (2) side chain and terminal residue covalent modifications, and (3) 
adsorption complexes, for example with cell membranes. Such covalent or 
aggregative derivatives are useful as immunogens, as reagents in 

30 immunoassays, or in purification methods such as for affinity purification 
of ligands or other binding ligands. For example, a DC protein antigen can 
be immobilized by covalent bonding to a solid support such as cyanogen 
bromide-activated Sepharose, by methods which are well known in the art, 
or adsorbed onto polyolefin surfaces, with or without glutaraldehyde 

3 5 cross-linking, for use in the assay or purification of anti-DC protein 

antibodies. The DC proteins can also be labeled with a detectable group, 
e.g„ radioiodinated by ttie chloramine T procedure, covalently bound to 
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rare earth dielates, or conjugated to another fluorescent moiety for use in 
di)|gn(a«tic assays. Purifltation df the^ DC proteins may be ejected by 
immobilized antibodies. 

Isolated DC jMrotein gwies vn31 allow transjformation o£ cells lackmg 
expression of a correspoiidihg DC protein, e.g., either species iypes or cells 
which lack corresponding proteins and exhibit negative background, 
activity. Expression of transformed genes will allow isolation of 
antigeriically pure cell lines, with defined or single specie variants. This 
approach will allow for more sensitive detection and discrimination of the 
physiological effects of these DC protans. Subcellular fragments, &g., 
cytoplasts or membrane fragments, can be isolated and used. 

Vm Binding AgentDC Protein Complexes 

A DC protein fliat specifically binds to or that is specifically 
immunoreactive with an antibody generated against a defined 
immunogen, such as an immunogen consisting of tiie amino acid sequence 
of SEQ ID Na ^ 4, 8, or 10> is detennined in an immunoassay. The 
immunoassay uses a polyclonal antiserum which was raised to the protein 
of SEQ ID NO: 2, 4, 8, or 10. This antiserum is selected to have low 
crossreactivity against other members of the related families, and any such 
crossreactiviiy is removed by immunoabsorbtion prior to use in the 
immunoassay. 

In order to produce antisera for use in an immunoassay, the protein 
of SEQ ID NO: 2, 4, 8, or 10, is isolated as described herein. For example, 
recombinant protein may be produced in a mammaUan cell line. An 
inbred strain of mice such as balb/c is immunized with tfie appropriate 
protein using a standard adjuvant such as Frexmd's adjuvant, and a 
standard mouse immunization protocol (see Harlow and Lane, supra). 
Alternatively, a synthetic peptide derived from the sequences disclosed 
herein and conjugated to a carrier protein can be used an immunogen. 
Polyclonal sera are collected and titered against the immunogen protein in 
an immunoassay, e.g., a solid phase immunoassay with the immunogen 
immobilized on a solid support. Polyclonal antisera with a titer of 104 or 
greater are selected and tested for their cross reactivity against other 
related proteins, using a competitive binding immunoassay such as the one 
described in Hariow and Lane, supra, at pages 570-573. Preferably two 
different related protiens are used in this determination in conjunction with 
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a |?ven EK2^protein. For example^ with the^ lectin protein, at least two other 
family members are uisfed to absorb oiit shared epitopes. In conjunction 
witii the DGMPl family member^ two oiher members of the family are 
used. These other family members can be produced as recombinant 
proteins and isolated using standard molecular biology and protein 
chemistry techniques as described herein. 

Immunoassays in the competitive, binding format can be used for the 
crossreactivity determinations. For example, the protein of SEQ ID NO: 2 
or 8 can be immobilized to a solid support. Proteins added to the assay 
compete with the binding of the antisera to the immobilized antigen. The 
ability of the above proteins to compete wilh the binding of the antisera to 
the immobilized protein is compared to tfie protein of SEQ ID NO 2 or 8. 
The percent crossreactivity for the above proteins is calculated, using 
standard calculations. Those antisera with less than 10% crossreactivity 
with each of the proteins listed above are selected and pooled. The cross- 
reacting antibodies are then removed from the pooled antisera by 
immunoabsorbtion witii the above-listed proteins. 

The immunoabsorbed and pooled antisera are then used in a 
competitive binding immxmoassay as described above to compare a second 
protein to the immunogen protein (e.g., tiie DC protein of SEQ ID NO: 2 or 
8). In order to make this comparison, the two proteins are each assayed at 
a wide range of concentrations and the amount of each protein required to 
inhibit 50% of the binding of the antisera to the immobilized protein is 
determined. If the amoimt of the second protein required is less than twice 
the amount of the protein of SEQ ID NO: 2 or 8 that is required, then ttie 
second protein is said to specifically bind to an antibody generated to the 
immunogen. 

It is understood that DC proteins are likely a family of homologous 
proteins that comprise two or more genes. For a particular gene product, 
such as the human Ig family member protein, the invention encompasses 
not only the amino acid sequences disclosed herein, but also to other 
proteins that are allelic, polymorphic, non-allelic, or species variants. It 
also understood that the term "human DC protein" includes nonnatural 
mutations introduced by deliberate mutetion using conventional 
recombinant technology such as single site mutation, or by excising short 
sections of DNA encoding these proteins or splice variants from the gene, 
or by substituting or adding samll numbers of new amino acids. Such 
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minor alteratiom must subsfantid^^ maintain the immmoide^^^ the 
onginal mbleciUe Sifid/of i^ Thus, these alterations 

include proteim that are specifically immunoreactive with a designated 
naturally occurring respective DC profein> e.g., the human DC protein 
5: exhibiting SEQ ID NO: 4. Particular protein modifications considered 

minor would include conservative substitution of amino acids witii similar 
chemical properties, as described above for each protein family as a whole. 
By aligning a protein optimally with the protein of SEQ ID NO 2 or 8, and 
by using the conventional immunoassays described herein to determine 
1 0 immunoidentity, one can determine the protein cojmpositions of the 
invention^ 

IX. Uses 

The present invention provides reagents which will find use in 

15 diagnostic applications as described elsewhere herein, e.g., in the general 
description for developmental abnormalities, or below in the description of 
kits for diagnosis. In particular, the genes will be useful as markers for 
distinguishing cell types, including genomic aspects of cells, as well as 
mRNA and protein expression patterns. 

20 DC genes, e.g., DNA or RNA may be used as a component in a 

forensic assay. For instance, the nucleotide sequences provided may be 
labeled using, e.g., 32p qj. biotin and used to probe standard restriction 
fragment polymorphism blots, providing a measurable character to aid in 
distinguishing between individuals. Such probes may be used in well- 

25 known forensic techniques such as genetic fingerprinting. In addition, 

nucleotide probes made from DC sequences may be used in in situ assays 
to detect chromosomal abnormalities. 

Antibodies and other binding agents directed towards DC proteins 
or nucleic acids may be used to purify the corresponding DC protein 

30 molecule. As described in the Examples below, antibody purification of 

DC proteins is both possible and practicable. Antibodies and other binding 
agents may also be used in a diagnostic fashion to determine whether DC 
components are present in a tissue sample or cell population using well- 
known techniques described herein. The ability to attach a binding agent 

35 to a DC protein provides a means to diagnose disorders associated with 

expression misregulation. Antibodies and other DC protein binding agents 
may also be useful as histological or forensic markers. As described in the 
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examples belpw^ the expression of 'eajch of. ttiese proteins is limited, to 
specific tissue: ifypes. By diluting a pfobe; siieh ^ s an antibody or nucleic 
acid to the respective DC protein, it is possible to use the probe to 
distiriguish tissue and cell types in situ or in vitro, 
5 This invention also provides reagents which may exhibit significaht 

therapeutic value. The DC proteins (naturally occiirring or recombinant), 
fragments thereof, and antibodies thereto, along \yith compoimds 
identified as having binding affinity to the DC protein, may be useful in 
the treatment of conditions associated with abhdnrial physiology or 

1 0 development, including abnormal proliferation, e.g cancerous conditions, 
or degenerative conditions. Abnormal proliferation, regeneration, 
degeneration, and atrophy may be modulated by appropriate therapeutic 
treatment using the compositions provided herein. For example, a disease 
or disorder associated with abnormal expression or abnormal signaling by 

15 a DC, e.g., as an antigen presenting cell, is a target for an agonist or 

antagonist of the protein. The proteins likely play a role in regulation or 
development of hematopoietic cells, e.g., lymphoid cells, which affect 
immunological responses, e.g., antigen presentation and the resulting 
effector functions. 

20 Other abnormal developmental conditions are known in cell types 

shown to possess DC protein mRNA by northern blot analysis. See Berkow 
(ed.) The Merck Manual of Diagnosis and Therapy, Merck & Co., Rahway, 
NJ; and Thorn, et al. Harrison's Principles of hitemal Medicine, McGraw- 
Hill, NY. Developmental or functional abnormalities, e.g., of the immune 

25 system, cause significant medical abnormalities and conditions which may 
be susceptible to prevention or treatment using compositions provided 
herein. 

Recombinant DC proteins or antibodies might be purified and then 
administered to a patient These reagents can be combined for therapeutic 

30 use with additional active or inert ingredients, e.g., in conventional 
pharmaceutically acceptable carriers or diluents, e.g., immunogenic 
adjuvants, along with physiologically innocuous stabilizers and excipients. 
In particular, these may be useful in a vaccine context, where the antigen is 
combined with one of these tiierapeutic versions of agonists or antagonists. 

35 These combinations can be sterile filtered and placed into dosage forms as 
by lyophilization in dosage vials or storage in stabilized aqueous 
preparations. This invention also contemplates use of antibodies or 
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binding fragments thereof^ including forms which are not complement 
Mnding. 

Drug screening liisihg antibodies or recepibr ox fragments thereof 
can identify compounds having binding affinity to these DC proteins> 
5 including isolation of associa ted compoiiehts. Subsequent .biological assays 
can then be utilized to determine if the compound has^ intrinsic stimulatihg 
actiyity and is. therefore a blocker or antagpnisl in that it blocks the activity 
of the protein. Likewise, a compound having intrinsic stiniulating activity 
might activate the cell through the protein and is thus an agonist in that it 

10 simulates the cell. This invention further cbntemplates the, therapeutic use 
of antibodies to the proteins as antagonists. 

The quantities of reagents necessary for effective therapy will 
depend upon many different factors, including means of administration, 
target site, physiological state of the patient, and other medicants 

15 administered. Thus, treatment dosages should be titrated to optimize 

safety and efficacy. Typically, dosages used in vitro may provide useful 
guidance in the amounts useful for in situ administration of these reagents. 
Animal testing of effective doses for treatment of particular disorders will 
provide further predictive indication of human dosage. Various 

20 considerations are described, e.g., in GUman, et al. (eds.) (1990) Goodman 
and Oilman's: The Pharmacological Bases of Therapeutics (8th ed.) 
Pergamon Press; and (1990) Remington's Pharmaceutical Sciences (17th ed.) 
Mack Pubhshing Co., Easton, PA. Methods for administration are 
discussed therein and below, e.g., for oral, intravenous, intraperitoneal, or 

25 intramuscular administration, transdermal diffusion, and othere. 

Pharmaceutically acceptable carriers will include water, saline, buffers, and 
other compounds described, e.g., in the Merck Index. Merck & Co., 
Rahway, NJ. Dosage ranges would ordinarily be expected to be in 
amounts lower than 1 mM concentrations, typically less than about 10 ^M 

30 concentrations, usually less than about 100 nM, preferably less than about 
10 pM (picomolar), and most preferably less than about 1 fM (femtomolar), 
with an appropriate carrier. Slow release formulations, or a slow release 
apparatus will often be utilized for continuous administration. 

The DC proteins, fragments thereof, and antibodies to it or its 

35 fragments, antagonists, and agonists, could be administered directly to the 
host to be treated or, depending on the size of the compounds, it may be 
desirable to conjugate them to carrier proteins such as ovalbumin or serum 
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albumin prior to their administration. Therapeutic formuiatipns may be 
administered iii many conventional dosage jformulations. While it is 
possible for tiie active ingredient to be administered alone, it is preferable 
to present it as a pharmaceutical formulation. Fonmilatidns typically 
5 comprise at least one active ingredient, as defined above, together witti one 
or more acceptable carriers thereof. Each carrier should be both 
pharmaceutically and physiologically acceptable in the sense of being 
compatible with the other ingredients and not injurious to the patient 
Formulations include those suitable for oral, rectal, nasal, or parenteral 
10 (including subcutaneous, intramuscular, intravenous and intradermal) 

administration. The formulations may conveniently be presented in imit 
dosage form and may be prepared by any methods well known in the art 
of pharmacy. See, e.g., Gilman, et al. (eds.) (1990) Goodman and Gilman's: 
The Pharmacological Bases of Therapeutics (8th ed.) Pergamon Press; and 
15 (1990) Remington^s Pharmaceutical Sciences (17th ed.) Mack Publishing 
Co., Easton, PA; Avis, et al. (eds.) (1993) Pharmaceutical Dosage Forms: 
Parenteral Medications Dekker, NY; Lieberman, et al. (eds.) (1990) 
Pharmaceu tical Dosgae Forms: Tablets Dekker, NY; and Lieberman, et al. 
(eds.) (1990) Pharmaceutical Dosage Forms: Disperse Systems Dekker, NY. 
2 0 The therapy of this invention may be combined with or used in association 
with other chemotherapeutic or chemopreventive agents. 

Both the naturally occurring and the recombinant form of the DC 
proteins of this invention are particularly useful in kits and assay methods 
which are capable of screening compounds for binding activity to the 
25 proteins. Several methods of automating assays have been developed in 

recent years so as to permit screening of tens of thousands of compounds in 
a short period. See, e.g., Fodor, et al. (1991) Science 251:767-773, and other 
descriptions of chemical diversity libraries, which describe meaxvs for 
testing of binding affinity by a plurality of compounds. The development 
30 of suitable assays can be greatly facilitated by the availability of large 

amounts of purified, e.g., soluble versions of, DC protein as provided by 
this invention. 

For example, antagonists can often be found once the protein has 
been structurally defined. Testing of potential protein analogs is now 
35 possible upon the development of highly automated assay methods using a 
purified surface protein. In particular, new agonists and antagonists will 
be discovered by using screening techniques described herein. Of 
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particular importance are compoiinds foiind to have a combined binding 
affihity for iniiliiple ivelated cell surface antigens, e.g., compoxmds wMch 
can serve as antagonists for species variants of a DC protein. 

Thisinventipn is particularly useful for screening compounds by 
using recombinant DC protein in a variety of ding screening techniques. 
The advantages of using a recombinant protein in screening for specific 
lijgands include: (a) improved renewable source of the protein from a 
specific source; (b) potentially greater number of antigens per cell giving 
better signal to noise ratio in assays; and (c) species variant specificity 
(theoretiically giving greater biological and disease specificity). 

One method of drug screening utilizes eukaryotic or prokaryotic 
host cells which are stably transformed with recombinant DNA molecules 
expressing a DC protein. Cells may be isolated which express that protein 
in isolation from any others. Such cells, either in viable or fixed form, can 
be used for standard surface protein binding assays. See also. Farce, et al. 
(1989) Science 246:243-247; and Owicki, et ah (1990) Proc. Naf 1 Acad. Sci. 
USA 87:4007-4011, which describe sensitive methods to detect cellular 
responses. Competitive assays are particularly useful, where the cells 
(source of DC protein) are contacted and incubated with an antibody 
having known binding affinity to the antigen, such as l^Sj.antibody, and a 
test sample whose binding affinity to the binding composition is being 
measured. The boxmd and fi-ee labeled binding compositions are then 
separated to assess the degree of protein binding. The amount of test 
compound bound is inversely proportional to the amoimt of labeled 
antibody binding to the known source. Many techniques can be used to 
separate bound from ft-ee reagent to assess the degree of binding. This 
separation step could typically involve a procedure such as adhesion to 
filters followed by washing, adhesion to plastic followed by washing, or 
centrifugation of the cell membranes. Viable cells could also be used to 
screen for the effects of drugs on these DC protein mediated functions, e.g., 
antigen presentation or helper function. 

Another method utilizes membranes from transformed eukaryotic or 
prokaryotic host cells as the source of a DC protein. These cells are stably 
transformed with DNA vectors directing the expression of the sppropriate 
protein, e.g., an engineered membrane bound form. Essentially, the 
membranes would be prepared from the cells and used in binding assays 
such as the competitive assay set forth above. 
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Sjill anpthei; approaich i$ tp use splujbilized, unptirified or 
solubilized, purijfied DC protein from transformed ettk^iiyotic or 
prpkatyptic host cells, Ws allows jfor a "mplecular" binding assay with flie 
advantages of increased specifidty, the ability to automate, and high drug 

5 test throughput 

Another technique for drug screening inyolves an approach which 
provides high throughput screening for compovuids having siiitable 
binding affinity to the respective DC protein and is described in detail in 
Geysen> European Patent Application 84/03564, published on September 

d 13; 1984. First, large numbers of different small peptide test compounds 
are ^nthesized on a solid substrate, e.g., plastic pins or some other 
appropriate surface, see Fodor, et al., supra. Then all the pins are reacted 
with solubilized, unpurified or solubilized, purified DC protein, and 
washed. The next step involves detecting bound reagent, e.g.> antibody. 

5 One means for determining which sites interact with specific other 

proteins is a physical structure determination, e.g., x-ray crystallography or 
2 dimensional NMR techniques. These will provide guidance as to which 
amino acid residues form molecular contact regions. For a detailed 
description of protein structural determination, see, e.g., Blundell and 

[) Johnson (1976) Protein Crvstallographv Academic Press, NY. 

X. Kits 

This invention also contemplates use of these DC proteins, 
fragments thereof, peptides, and their fusion products in a variety of 

5 diagnostic kits and methods for detecting the presence of a DC protein or 
message. Typically the kit will have a compartment containing either a 
defined DC peptide or gene segment or a reagent which recognizes one or 
ttie other, e.g., antibodies. 

A kit for determining the binding affinity of a test compound to the 

) respective DC protein would typically comprise a test compound; a labeled 
compound, for example an antibody having known binding affinity for the 
protein; a source of the DC protein (naturally occurring or recombinant); 
and a means for separating bound from free labeled compound, such as a 
soUd phase for immobilizing the DC protein. Once compounds are 

; screened, those having suitable binding affinity to the protein can be 
evaluated in suitable biological assays, as are well known in the art, to 
determine whether they act as agonists or antagonists to regulate DC 
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iiinction. The availability of reGombinant DG polypeptides also provide 
weW defined standards for calibrating such assays. 

A preferred kit for determining the concentration of; for example^ a 
DC protein in a sample would typically comprise a labeled compound, e.g., 
5 antibody, having known binding affinity for the DC protein, a source of 
DC protein (naturally occurring or recombinant) and a means for 
separatirig the bound from free labeled compound, for example, a isolid 
phase for immobihzing the DC protein. Compartments containing reagent5> 
and instructions, will normally be provided. 

^0 Antibodies, including antigen binding fragments, specific for the 

respective DC or its fragments are useful in diagnostic applications to 
detect the presence of elevated levels of the protein and/or its fragments. 
Such diagnostic assays can employ lysates, live eells, fixed cells, 
immunofiuorescence, cell cultures, body fluids> and further can involve tiie 

15 detection of antigens in serum, or the like. Diagnostic assays may be 

homogeneous (without a separation step between free reagent and antigen- 
DC protein complex) or heterogeneous (witii a separation step). Various 
commercial assays exist, such as radioimmunoassay (RIA), enzyme-linked 
immimosorbentassay (ELISA), en2yme immunoassay (EIA), en2yme- 

20 multiplied immunoassay technique (EMU), substrate-labeled fluorescent 

immunoassay (SLFIA), and the like. For example, unlabeled antibodies can 
be employed by using a second antibody which is labeled and which 
recognizes the antibody to the DC protein or to a particular fragment 
ttiereof. Similar assays have also been extensively discussed in the 

25 literature. See, e.g., Harlow and Lane (1988) Antibodies: A Laboratory 

Manual CSH Press, NY; Chan (ed.) (1987) Immunoassay: A Practical Guide 
Academic Press, Orlando, FL; Price and Newman (eds.) (1991) Principles 
and Practice of Immunoassav Stockton Press, NY; and Ngo (ed.) (1988) 
Nonisotopic Immunoassay Plenum Press, NY. In particular, the reagents 

30 may be useful for diagnosing DC populations in biological samples, either 
to detect an excess or deficiency of DC in a sample. The assay may be 
directed to histological analysis of a biopsy, or evaluation of DC numbers 
in a blood or tissue sample. 

Anti-idiotypic antibodies may have similar use to diagnose presence 

35 of antibodies against a DC protein, as such may be diagnostic of various 
abnormal states. For example, overproduction of the DC protein may 
result in various immunological reactions which may be diagnostic of 
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abnormal physibiogical states^ psuiiailafly in prpliferatiye cell conditions 
su^zh as caiicei: or at>nonnal;d^^ 

Frequentiy/ the reagents for diagnostic assays are: supplied in kits> 
so as to optimize the sen$itivity of tfie assay. For the subject invention, 
5 depending iipori the nature of the assay, the protocol, and the label, either 
labeled or unlabeled antibody or receptor, or labeled DC protein is 
provided. This is usually in conjunctioh with other additives, such as 
buffers, stabiUzers, materials necessary for signal production such as 
substrates for, enzymes, and the like. Preferably, the kit will also contain 

10 instructidiis for proper use and disposal of the contents after use* Typically 
the kit has compartments for each useful reagent Desirably, the reagents 
are provided, as a dry lyopMlized powder, where the reagents may be 
reconstituted in an aqueous medium providing appropriate concentrations 
of reagents for performing the assay., 

15 Many of the aforementioned constituents of the drug screening and 

the diagnostic assays may be used without modification or may be 
modified in a variety of ways. For example, labeling may be achieved by 
covalentiy or non-covalentiy joining a moiety which directiy or indirectty 
provides a detectable signal. In many of these assays, the protein, test 

20 compound, DC protein, or antibodies thereto can be labeled either directly 
or indirectiy. Possibilities for direct labeling include label groups: 
radiolabels such as l^^I, enzymes (U.S. Pat No. 3,645,090) such as 
peroxidase and alkaline phosphatase, and fluorescent labels (U.S. Pat No. 
3,940,475) capable of monitoring the change in fluorescence intensity, 

2 5 wavelength shift, or fluorescence polarization. Possibilities for indirect 
labeling include biotinylation of one constituent foUowed by binding to 
avidin coupled to one of the above label groups. 

There are also numerous methods of separating the bound from the 
free protein, or alternatively the bound from the free test compound. The 

30 DC protein can be immobilized on various matrices followed by washing. 
Suitable matrices include plastic such as an ELBA plate, filters, and beads. 
Methods of immobilizing the DC protein to a matrix include, without 
limitation, direct adhesion to plastic, use of a capture antibody, chemical 
coupling, and biotin-avidin. The last step in this approach involves the 

35 precipitation of protein/ antibody complex by one of several methods 
including those utilizing, e.g., an organic solvent such as polyethylene 
glycol or a salt such as ammonium sulfate. Other suitable separation 
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techniques include, wiOiput Itmitatipn, the fluorescein antibody 
magnetizable particle method described in Ratti^/ et al. (1984) Glin. Chemv 
30:1457-1461, and the double antibody magnetic particle separation as 
described in U.S. Pat No. 4,659,678, 
5 Methods for linking proteins or their fragments to the various labels 

have been extensively reported in the literature and do not require detailed 
discussion here. Many of the techniques involve the use of activated 
carboxyl groups either through the use of carbodiimide or active esters to 
form peptide bonds, the formation of thioethers by reaction of a mercapto 

10 group with an activated halogen such as chloroacetyl, or an activated olefin 
such as maleimide, for linkage, or the like. Fusion proteins will also find 
use in these applications. 

Another diagnostic aspect of this invention involves use of 
oligonucleotide or pol}aiucleotide sequences taken from the sequence of a 

15 respective DC protein. These sequences can be used as probes for detecting 
levels of the message in samples from patients suspected of having an 
abnormal condition, e.g., cancer or immune problem. The preparation of 
both RNA and DNA nucleotide sequences, the labeling of the sequences, 
and the preferred size of the sequences has received ample description and 

20 discussion in the literature. Normally an oligonucleotide probe should 
have at least about 14 nucleotides, usually at least about 18 nucleotides, 
and the polynucleotide probes may be up to several kilobases. Various 
labels may be employed, most commonly radionuclides, particularly 32p. 
However, other techniques may also be employed, such as using biotin 

25 modified nucleotides for introduction into a polynucleotide. The biotin 
then serves as the site for binding to avidin or antibodies, which may be 
labeled with a wide variety of labels, such as radionuclides, fluorophores, 
enzymes, or the like. Alternatively, antibodies may be employed which 
can recognize specific duplexes, including DNA duplexes, RNA duplexes, 

30 DNA-RNA hybrid duplexes, or DNA-protein duplexes. The antibodies in 
turn may be labeled and the assay carried out where the duplex is bound to 
a surface, so that upon the formation of duplex on the surface, the presence 
of antibody bound to the duplex can be detected. The use of probes to the 
novel anti-sense RNA may be carried out in any conventional techniques 

35 such as nucleic acid hybridization, plus and minus screening, 

recombinational probing, hybrid released translation (HRT), and hybrid 
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arrested translation (HART). This also indudies ampUHcatipti te^ 
swch as polymerase chain reaction (PCR). 

Diagnostic hits which also test for ti\e qiiaUtative or quantitative 
presence of other markets are also contemplated. Diagnosis or prognosis 
5 may depend on the combination of multiple indications used as markers. 
Thus, kits may test for combinations of markers. See, e-g., Viallet; et aL 
(1989) Proeress in Growth Factor Res. 1:89-97. 

XI. Binding Partner Isolation 

10 Having isolated one member of a binding partner of a specific 

interaction, methods exist for isolating the counter-partner. See, Gearing, et 
aL (1989) EMBO T, 8:3667-3676, For example, means to label a DC surface 
protein without interfering with the binding to its receptor can be 
determined. For example, an affinity label can be fused to either the 

15 amino- or carboxyl-terminus of the ligand. An expression library can be 
screened for specific binding to the DC protein, e.g., by cell sorting, or 
other screening to detect subpopulations which express such a binding 
component See, e.g., Ho, et al. (1993) Proc. Naf 1 Acad. Sci. USA 90:11267- 
11271. Alternatively, a panning method may be used. See, e.g.. Seed and 

20 Aruffo (1987) Proc. Natl Acad. Sci. USA 84:3365-3369. A two-hybrid 

selection system may also be applied making appropriate constructs with 
the available DC protein sequences. See, e.g., Fields and Song (1989) 
Nature 340:245-246. 

Protein cross-linking techniques with label can be applied to isolate 

25 binding partners of a DC protein. This would allow identification of 
proteins which specifically interact with the appropriate DC protein. 

The broad scope of this invention is best imderstood with reference 
to the following examples, which are not intended to limit the invention to 
specific embodiments. 

30 

EXAMPLES 

I. General Methods 

Many of the standard methods below are described or referenced, 
e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual 
35 Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY; Sambrook, 
et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed.) Vols. 1-3, 
CSH Press, NY; Ausubel, et al.. Biology Greene Publishing Associates, 
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Brooklyn, NY; or Ausubd, et aL (1987 arid Supplemerits> CUiTent Protocols 
in Molecular Biolopry Wfley/Greene, NY; Innis, et al. (eds.) (1990) PGR 
Protocols: A Gtude to Methods and Applications Academic Press, MY, 

Methods for protein purification include such methods as 
ammonium sulfate precipitation, column chromatography, electrophoresis, 
centrifugation, crystallization, and others. See, e.g., Ausubel, et^aL (1987 
and periodic supplements); Deutscher (1990) "Guide to Protein 
Purification," Methods in Enzvmologv vol 182, and other volumes in this 
series; Cbligan, et al. (1996 and periodic Supplements) Current Protocols in 
Protein Science Wiley/Greenp. NY; and manufacturer's literature on use of 
profein purification products, e.g., Pharmiacia, Pistata>yay, NJ, 6r Bio-Rad, 
Richmond^ CA. Combination with recombinant techniques allow iPusion to 
appropriate segments, e.g., to a FLAG sequence or to equival^t wWch c 
be fused via a protease-removable sequence. See, e.g,> Hochuli (l989)i 
Chemische Industrie 12:69-70: Hochuli (1990) Turification of Recombinant 
Proteins with Metal Chelate Absorbent" in Setiow (ed.) Genetic 
Eneineerin g. Principle and Methods 12:87-98, Plenum Press, NY; and 
Crowe, et al. (1992) QIAexpress: The High Level Expression & Protein 
Purification System QUIAGEN, Inc., Chatsworth, CA. 

Methods for determining immunological function are described, 
e.g., in Coligan, et aL (1992 and periodic Supplements) Current Protocols in 
Immunology Wilev/Greene. NY. See also, e.g., Paul (ed.) (1993) 
Fundamental Immunology (3d ed.) Raven Press, N.Y. 

FACS analyses are described in Melamed, et al. (1990) Flow 
Cytometry and Sorting Wiley-Liss, Inc., New York, NY; Shapiro (1988) 
Practical Flow Cytometry Liss, New York, NY; and Robinson, et aL (1993) 
Handbook of Flow Cytometry Methods Wiley-Iiss, New York, NY. 

n. Generation of dendritic cells 

Human CD34+ cells were obtained as follows. See, e.g., Caux, et aL 
(1995) pages 1-5 in Banchereau and Schmitt Dendritic Cells in Fundamental 
and Clinical Immunology Plenum Press, NY. Peripheral or cord blood 
cells, sometimes CD34+ selected, were cultured in the presence of Stem 
Cell Factor (SCF), GM-CSF, and TNF-a in endotoxin fi-ee RPMI 1640 
medium (GIBCO, Grand Island, NY) supplemented with 10% (v/v) heat- 
inactivated fetal bovine serum (FBS; Flow Laboratories, Irvine, CA), 10 mM 
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.HEPES^ 2 inM L-glutamine, 5 X lOr^ M 2-mercaptoethanoi> penicillin (100 - 
mg/ml). This is referred to as complete medium. 

GP34+ cells were seeded for expansion in 2S to 75 cm^ flasks 
(Coming, NY) at 2 x 104 cells/ml. Optimal conditions were maintained by 
5 splitting these cultures at day 5 and 1 0 with medium containing fresh GM- 
CSF and TNF-a (cell concentration: 1-3 x 10^ cells/ml). In certain cases, 
cells were FACS sorted for CDla expression at about day 6. 

In certain situatioriis, cells were routinely collected after 12 days of 
culture, eventually adherent cells were recovered using a 5 mM EDTA 

10 solution. In other situations, the CDla+ cells were activated by 

resuspension in complete meditmi at 5 x 10^ cells/ml and activated for the 
appropriate time (e.g., 1 or 6 h) with 1 mg/ml phorbol 12^inyristate 13^ 
acetate (PMA, Sigma) and 100 ng/ml ionomycin (Cialbiochem, La Jolla, 
CA). these cells were expanded for another 6 days, and RNA isolated for 

15 cDNA library preparation. 

in. RNA isolation and library construction 

Total RNA is isolated using, e.g., the guanidine thiocyanate/CsQ 

gradient procedure as described by Chirgwin, et al. (1978) Biochem. 
20 18:5294-5299. 

Alternatively, poly(A)+ RNA is isolated using the OLIGOTEX 

mRNA isolation kit (QIAGEN). Double stranded cDNA are generated 

using, e.g., the SUPERSCRIPT plasmid system (Gibco BRL, Gaithersburg, 

MD) for cDNA synthesis and plasmid cloning. The resulting double 
25 stranded cDNA is unidirectionally cloned, e.g., into pSportl and 

b-ansfected by electroporation into ELECTROMAX DHIOBTM Cells (Gibco 

BRL, Gaithersburg, MD). 

IV. Sequencing 

30 DNA isolated from randomly picked clones, or after subtractive 

hybridization using unactivated cells, were subjected to nucleotide 
sequence analysis using standard techniques. A Taq DiDeoxy Terminator 
cycle sequencing kit (Applied Biosystems, Foster City, CA) can be used. 
The labeled DNA fragments are separated using a DNA sequencing gel of 

35 an appropriate automated sequencer. Alternatively, the isolated clone is 
sequenced as described, e.g., in Maniatis, et al. (1982) Molecular Cloning. 
A Laborato ry Manual. Cold Spring Harbor Laboratory, Cold Spring 
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Harbor Press; Sambrook, et aL (1989) Molecular aoning: A Laboratory 
Manual. (2d ed.), vols 1-3, CSH Press, NY; Ausubel, et al.. Biology. Greene 
Publishing Associates, Brooklyn, NY; or Ausubel et aL (1987 and 
Supplements) Current Protocols in Molecular Biology^ Greene/Wiley, New 
5 York Chemical sfequehciiig methods are also available, e.g., using Maxami 
and Gilbert sequencing techniques. 

V. Recombinant DC gene construct 

Poly(A)"*' RNA is isolated from appropriate cell populations, e.g., 
10 using the FastTrack mRNA kit (Invitrogen, San Diego, CA). Samples are 
electrophoresed, e.g., in a 1% agarose gel containing formaldehyde and 
tranisferred to a GeneScreen membrane (NEN Research Products, Boston, 
MA). Hybridization is performed, e.g., at 65_ C in 0.5 M NaHP04 pH 7.2, 

7% SDS, 1 mM EDTA, and 1% BSA (fraction V) with 32p^CTP labeled DC 
15 gene cDNA at 1Q7 cpm/mL After hybridization filters are washed three 
times at 50_ C in 0.2X SSC, 0.1% SDS, and exposed to film for 24 h. 

The recombinant gene construct may be used to generate a probe for 
detecting the message. The insert may be excised and used in the detection 
methods described above. 

20 

VL Expression of DC gene Protein in E, coli 

PCR is used to make a construct comprising the open reading frame, 
preferably in operable association with proper promoter, selection, and 
regulatory sequences. The resulting expression plasmid is transformed 

25 into an appropriate, e.g., the Topp5, E. coli strain (Stratagene, La JoUa, CA). 
Ampicillin resistant (50 fig/ ml) transformants are grown in Luria Brotti 
(Gibco) at 37_ C until the optical density at 550 nm is 0.7. Recombinant 
protein is induced with 0.4 mM isopropyl-bD-thiogalacto-pyranoside 
(Sigma, St. Louis, MO) and incubation of the cells continued at 20_ C for a 

30 further 18 hours. Cells from a 1 Liter culture are harvested by 

centrifugation and resuspended, e.g., in 200 ml of ice cold 30% sucrose, 50 
mM Tris HCl pH 8.0, 1 mM ethylenediaminetetraacetic acid. After 10 min 
on ice, ice cold water is added to a total volume of 2 liters. After 20 min on 
ice, cells are removed by centrifugation and the supernatant is clarified by 

35 filtration via a 5 nM Millipak 60 (Millipore Corp., Bedford, MA). 

The recombinant protein is purified via standard purification 
methods, e.g., various ion echange chromatography methods. 
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Jmihunoaffimty meihods usmg aintibodies described below can also be 
used. Affinity methods may be Where an epitope tag is engineered 
into an expression construct 

5 Vn. Mapping of human E)C genes 

DNA isolation, restriction enzyme digestion, agarose gel 
electrophoresis. Southern blot transfer and hybridization are peif pnned 
according to standard techniques. See Jenkins, et al. (1982) T- ViroL 
43:26-36, Blots may be prepared with Hybond-N nylon membrane 
10 (Amersham). The probe is labeled with 32p.dCTP; washing is done to a 
final stringency, e.g., of O.IX SSC, 0.1% SDS, 65_ C 

Alternatively, a BIOS Laboratories (New Haven, CT) mouse somatic 
cell hybrid panel may be. combined with PCR metiiods. See Fan, et al. 
(1996) Immimoeenetics 44:97-103. 
15 Chromosomal localization with a Stanford G3 panel gave as closest 

marker SHGC-12041, with a lod of 7.7. This marker, which is the gene 
coding for M130 antigen, is localized to chromosome 12pl3. This 
localization is host to a niunber of genes encoding receptors of the C-type 
lectin family, notably CD69, and the NK receptor fomily. 

20 

Vin. Analysis of individual variation 

From the distribution data, an abundant easily accessible cell type is 
selected for sampling from individuals. Using PCR techniques, a large 
population of individuals are analysed for this gene. cDNA or otiier PCR 
25 methods are used to sequence the corresponding gene in the different 
individuals, and their sequences are compared. This indicates both the 
extent of divergence among racial or other populations, as well as 
determining which residues are likely to be modifiable without dramatic 
effects on function. 

30 

IX. Preparation of Antibodies 

Recombinant DC proteins are generated by expression in E. coli as 
shown above, and tested for biological activity. Alternatively, natural 
protein sources may be used with purification methods made available. 
35 Antibody reagents may be used in immunopiu*ification, or to track 
separation methods. Active or denatured proteins may be used for 
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imihunizatiori of appropriate mammals for either polycldngd seriun 
productioiv or for monoGlonal antibody prc^duction. 

X. kolation of counterpart primate or rodent DC genes 
.5 Human cDNA dones encoding these genes are used as probes, or to 

design PCR primers to find counterparts in various primate species, e.g., 
chimpanzees. 

Bioinfoimatics searches of the EST database (GenBaidc dbEST) 
using the predicted polypeptide sequence of DCMPl (tblastn algorithm) 

10 revealed mouse clones encoding a protein homologous to primate DCMPl . 
Four clones corresponding to this sequence were seen: AA387662 Ko mouse 
embiyo 11 5dpc; AA170532 mouse spleen; AA475012 mouse mammary 
gland; and AA423158 mouse mammary gland. One of these, AA170532, 
estimated to be a full length clone by sequence analysis was selected and 

15 DNA sequenced. This clone contained features similar to DCMPl. The full 
length clone is 1418 bp, excluding the poly-A sequence and contains a 5' 
UTR of 278 bp. As for hDCMPl, the putative start codon is not contained 
witiiin a consensus Kozak region, but this codon is preceeded by an 
upstream stop codon. The 5* UTR contains sequences similar to rapid 

20 degradation signals, including three consensus ATTTA sites. A potential 
polyadenylation sequence is seen. The predicted polypeptide is about 238 
residues in length and codes for a type II membrane protein with an ITIM 
and a C-type lectin domain. Three potential N-glycosylation sites are seen. 
Alignments of this and the human protein show about 54% identity, 65% 

25 homology over the whole sequence. Notably, the ITIM domains are highly 
conserved (13 out of 15 residues are identical). Of interest is the conserved 
membrane-proximal glutamine motif (FQKYSQLLE), and the cysteine 
residue potentially implicated in disulphide bridge formation. Equally the 
C-type lectin domains show blocks of conservation, including the EPS 

30 motif. Differences seen between hDCMPl and the human hepatic lectins 
are retained in the mouse sequence, notably the replacement of tryptophan 
at position 119 and 163; a glutamine at position 177; and serine instead of 
tryptophan at position 228. It thus appears that this clone is the mouse 
homologue of hDCMPl. 

35 

XL Use of reagents to analyze ceU populations 
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Detection of the level of dendritic cells present in a a sample is' 
important for diagnoisis of aberrant disease cohditibhs. For example, an 
increase in the number of dendritic cells in a tissue or the lymph system 
can be indicative of ittie presence 0f a DC hyperplasia, or tissue or graft 
5 rejection* A low DC poptdation can indicate an abnormal reaction to, e.g., 
a bacterial or viral infection, which may require the appropriate treat to 
normalize the DC response. 

FACS analysis using a labeled binding agent specific for a cell 
surface DC proteiiv see, e g., Melamed, et al. (1990) Flow Cytometry and 
10 Sorting Wiley-Liss, Inc., New York, NY; Shapiro (1988) Practical Flow 

Cytometry Liss, Niew York, NY; and Robinson, et aL (1993) Handbook of 
Flow Cytometry Methods Wiley-Liss, New York, NY, is used in 
determining the number of DCs present in a cell mixture, e.g., PBMCs, 
adherent cells^ etc. The binding agent is also used for histological analysis 
15 of tissue samplies, either fresh or fixed^ to analyze infiltration of DC. 

Diverse cell populations may alsQ be evaluated, either in a cell destructive 
assay, or in certain assays where cells retain viability. 

Analysis of the presence of soluble intracellular molecules is 
performed, e.g., with a fluorescent binding agent specific for a DC as 
2 0 described in Openshaw, et al. (1995) T.Exp. Med. 182:1357-1367. 
Alternatively, tissue or cell fixation methods may be used. 

Levels of DC transcripts are quantitated, e.g., using semiquantitative 
PCR as described in Murphy, et al. (1993) L Immunol. Methods 162:211- 
223. Primers are designed such that genomic DNA is not detected. 

25 

Xn. Expression distribution 

Analysis of the entire DCMPl cDNA sequence in a sequence 
database revealed an expression pattern restricted to a limited number of 
libraries. The greatest number of sequences (ten) were detected in 

30 Dendritic Cell Ubraries, four sequences in a library of osteoclastoma cells, 

and single sequences from libraries of macrophages generated in vitro from 
monocytes, LPS activated neutrophils, chondrosarcoma, colon cancer, T- 
cell lymphoma, skin tumor and chronic synovitis. In the GenBank dbEST, 
four clones were detected: AA418441, subtracted library; AA446401, total 

35 fetus; AA677149, fetal liver spleen; C01555, Human Gene Signature; 
AA380065, Skin tumor. 
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Analysis of DCMPl expression by JRT-PCR over a number of 
different cell lines and freshly isolated cells showed that expression of 
DCMPl is not detected in TFl (Myeloid precursor), Jurkat (a T cell line), 
GHA (kidney carcinoma), MRC5 (fetal lung fibroblasts), JY (B celUine), 
5 U937 (myelo-mottbcytic lym{>hbma cell line), biit is restrict to 

haematopoeitic cells. In freshly isolated cells, expression is seen in DC, 
both non-activated and activated; granulocytes activated; PBL, bQth:noii- 
activated and activated; and a low level of expression is seen in monoqrtes 
activated; and B cells activated. All. activated samples were pools of cells 

10 treated with PMA/iohomycin for 1 arid 6 hours. 

Additional analysis showed that expression of DCMPl varied with 
the activation state of the cell. RT-PCR was also used to detect the 
expression of DCMPl under different activation states. B cells isolated 
from tonsillar tissue were treated with PMA/ionomycin for 1 or 6 hours or 

15 by coculhire with CD40L-expressing L cells for 3, 12, and 24 hours. mRNA 
was detected in non-activated cells. This expression could be lost within 1 
hour for PMA/ionomydn treatment and after 3 houis of CD40L treatment 
In contrast, no expression could be detected in T cells, even after anti- 
CD3/anti-CD28 incubation. 

20 In CD34+ derived cells expression of DCMPl was sfrong in 

macrophages derived from CD34+ progenitor cells in the presence of M- 
CSF. This expression did not appear to alter in response to 
PMA/ionomycin. In DC derived from CD34+ progenitor cells in the 
presence of GM-CSF and TNFa, the level of mRNA was seen to vary over 

25 the time course of culture, wiih greater amounts of mRNA detected at day 
12 of culhire. After 48 hours of coculture with CD40L bearing L cells, the 
expression of DCMH is lost In in vitro DC FACS sorted at day 6 for the 
presence of markers CDla/CD14 and continued in culture for a further 6 
days, more mRNA was detected in the CD14 tiwn in tfie CDla 

3 0 subpopulation. This expression was down-regulated by PMA/ionomycin 
freatment 

Jn monocytes isolated from blood, no mRNA was detected in non- 
activated cells. However, expression of DCMPl was detected after 6 hours 
of treatment with PMA/ionomycin. In DC derived from monocytes by 
3 5 treabnent with GM-CSF and IL-4, DCMPl expression was upregulated. 

This expression could not be altered by treatment with PMA/ionomycin, 
but could be downregulated by coculhire with CD40L expressing L cells. 
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In this case DCMPl mRNA expression was totally lost l>y 24 hours of 
culture. Expression of the human protein was cinfirmed using antibody 
detection methods. 

tXZMPl was expressed in subsets of DC isolated ex Vivjo, DC 
5 subsets isolated from blood or :from tonsillar tissue were characterized by 
the presence or absence of the integrin GDllc. Larson iand Springer (1990) 
Immunol. Rev. 114:181-217, The CDllc+ subset of DC isolated from blood 
(also known as GCDC) express DCMPl. However, no mRNA is detected 
after activation via an anti-CD40L or PMA/ionomycin treatment In 

10 contrast the same subset of cells isolated from tonsillar tissue no longer 
express DCMPl. In the case of the CDllc- DC subset a low level of 
expression is observed in cells isolated from blood. This expression is 
greater in cells isolated from tonsillar tissue, but again is downregulated on 
activation via an anti-CD40 antibody or witii PMA/ionomycin treatment 

15 Langerhans cells isolated from skin express DCMPl, while ttie surrounding 
basal cells show no expression, 

Xm. Primate DCMPl 

Sequence analysis suggests these DCMPs are members of the 

20 lectin/ asialoglycoprotein superfamily of receptors. In particular, the 
heapatic and macrophage lectins have been associated with the 
internalization of proteins and peptides, which, e.g., might be important in 
the uptake and presentation of antigen by dendritic cells. The DCMPl 
contains an internalization motif (YxxV) or an ITIM-like motif (IxYxxV; 

25 residues 5-10 of SEQ ID NO: 2; a more extended motif runs from residues 1 
to 24), This suggests that the protein may be a dendritic cell version of the 
family of Inhibitory Receptors (KIR; LIR, etc.), which send a negative signal 
to inhibit cell function. 

The putative open reading frame commences at about nucleotide 

30 242. This potential start codon is not in a consensus Kozak sequence, but 
since it is not preceeded by an alternative ATG and a stop codon exists at 
upstream position 200, it is predicted that this is the start of the encoded 
protein. A polypeptide of about 237 amino acids was predicted from this 
sequence. No signal peptide was detected, but a putative transmembrane 

3 5 sequence extends from positions about 386 to 443. This clone encodes a 
type n membrane protein with a C-type lectin domain. The 3' UTR 
contains a number of potential rapid degradation signals, including three 
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repeats of the consensus sequence ATTTA. No signal peptide was 
defected, but putiative transmembra^ extend from positions 45 

to 62, or alternatively, 386 to 443. This clone encodes a type II membrane 
protein with a C-type lectin domain, 
5 The polypeptide predicted from the sequence analysis has a 49 

amino acid intracellular domain which includes a tyrosine-based motif 
cenhred at residue 7. YXX(L/ V) motifis of this nature have been shown to 
act as internalization motifs in the case of the hepatic lectins and CD23 (Fee 
Rlla). This type of domain has been shown to act as activation (TTAM) or 

10 inhibitory (ITIM) motifs in molecules such as Ly49, NKG2A, and the KIR 
family of immunoglobulin-like molecules. Inhibition is mediated by the 
recruibnent of SHP2/SHIP phosphatases to the consensus domain 
(I/V)XYXX(L/V). The first 15 amino acids of DCMPl show conservation to 
the extended ITIM domain, and it seems likely that inhibition of cell 

15 function is one of the attributes of DCMPl. A single potential N- 
glycosylation site is present at about position 185. 

Comparison of the amino acid sequence of the C-type lectin domain 
of DCMPl with other proteins containing C-type lectin domains showed 
that DCMPl has the greatest homology to tiie hepatic lectins and the 

20 macrophage lectin (see Table 1). The conserved cysteine residues of the C- 
type lectin fold are clearly conserved across the members of this family, 
however a number of distijigishing features can be seen. Like the hepatic 
lectins, DCMPl has a double cysteine motif at the start of the lectin domain. 
The function of this supplementary cysteine is unknown as there is 

2 5 apparently no other cysteine in the lectin domain that may form a 

disulphide bridge with this residue. It is possible that this residue may be 
involved in intermolecular disulphide bridge formation, although there is 
another cysteine in DCMPl at position 91 which probably fulfils this 
function. The N-terminal portion of the DCMPl lectin domain shows 

3 0 greatest conservation with the hepatic lectins and the macrophage lectin. 

The calcium-binding domain is conserved in DCMPl and shows greatest 
homology to CD23, including the EPS motif (residues 195-197), glutamate 
(E) at position 201 and asparagine aspartate (ND) at position 218-219. These 
motifs are noticeably absent from the NK receptors (NKGE shown here) 
35 and CD94. 

The DCMPl is a type II membrane protein with the predicted 
h-ansmembrane segment from about residues 45 to 62. It is related to the 
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family of proteins which includes asialoglycoprotein receptors, hepatic 
lectins; CD69, CD72, CD23, and NK.receptors. This protein contains an 
extracelliflar Ca dependent C-type lectin domain at the carboxy terminus 
(from about residues 104 to 237), which exhibits the motifs characteristic of 
5 sugar residue specificity. See Table 1. These proteins typically bind to 

sugar residues on glycoproteins and are implicated in the primary immune 
response. Several members of this family^ npfably CD69, the NK receptors, 
arid CD72, have been shown to transmit a signal during cellular activation 
events including proliferation and the induction of specific genes. 

10 DCMPl, like the mouse C-type lectin KIR receptor, Ly 49, contains 

an intemalisation motif with extended homology to the group of inhibitory 
receptors (ITIM domain, see recent reviews by Vivier and Daeron (1997) 
Immunology Today 18:286-291; and Katz and Austen (1997) T Immunol. 
158:5065-5070). These receptors, either Immunoglobulin superfamUy 

15 (IgSF) members or G-type lectins, bransmit a negative signal via SH2- 

domain containing phosphatases, e.g., SHIP, SHP-1, and SHP-2. Evidence 
suggests that these receptors associate with other activation receptors in 
order to block activation signals. Evidence also suggests that the Hgand for 
tiiis type of molecule is an IgSF molecule. Examples of this are the MHC 

20 class I molecules (recognised by the CD94/NK receptors and Ly-49) and 
the FcgR (CD23; which recognises IgG). 

The cysteine residue 91 is likely to be involved in disulfide linkage 
to another polypeptide, perhaps a homo or heterodimer. 

PGR analysis indicates that the gene is expressed in activated 

25 dendritic cells and non-activated dendritic cells. Detectable signals were 
not found in any of TFl (hematopoietic cell line), Jurkat (T cell line), MRG5 
(lung fibroblast sarcoma cell Hne), JY (B cell tine), U937 (pre-monocyte ceU 
line), or GHA (carcinoma cell line) cells. Positive signals were detected in 
freshly isolated activated or non-activated PBLs, and granulocytes, but 

3 0 only weak signals from freshly isolated T cells, B cells, NK cells, or 
monocytes. 

Sequence analysis indicates expression of the gene in samples 
characterized as dendritic cells, activated neutrophils, macrophages 
(activated with GM-CSF), osteoclastoma, skin tumor, T-cell lymphoma, 
35 colon cancer, chronic synovitis, and chrondrosarcoma. 

XIV. Rodent counterpart DGMPl 
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Table 2 shows sequence of roderit cbunterpart seque^^ 

Table 2: The sequence, shows homology to two BSTs of mouse, W33446 
(see iSBQ ID NO: 11) aiid AA17d532 (see SEQ ID NO,: 8) which code for 
5 the; mouse counterpart of DCMPi (see SEQ ID NO: 8) > 

hPCMPl l!n'SEITyAE\mFKNEPKSSGIOTASSJ^ 

W33446 - — - . - - - ... _ . _ . . 

1 7053 2 MASBITYAEVKFKNBSNSIJriTSgSPAAPREKPTPnT 

1.0 mDOMPl -MASEitYAEVKFIdteSNSIi^^ 



hDCMPl LIPEIiLLMSFFIAFVIFFQBnrSQL^ 

W33446 • - -E-raCiKEIOTTELECTKWi^ 

15 170532 MliLT.T.TJjAIITXVAFiryFQKySQl^EBKKAAKNIM 

mDCMPi MLLLLIJJUTFLVAFIIYFQKYSQLLEBI^^ PME 



hDCMPl ETAWSGCPKNWKSFSSNCYFISTB - - SASWQDSEKDGARMEAHLLVINTQ 
20 W3 3446 DKVWSCCPKDWKPFGSYCYFTSTD - LVASWNESKENCFHMGAHLWIHSQ 

mDCMPl DKVWSCCPKDWI^FGSHCYLVPTVSSSASWI«CSEENCSRMGy^ 



hDCMPl EEQDFIFQNLQEESAYFVGLSDPEGQIWWQWVDQTPYNESSTFWHPRBPS 
25 W33446 EEQ 

mDCMPl EEQDFITGILDTHAAYFIGLWD - TGHRQWQWVDQTPYEESITFWHNGEPS 



hDCMPl DPNERCWIiNFR - KSPKRWGWNDVNCLGPQRSVCBMMKIHL 
30 mDCMPl SGNEKCarilYRWICr- -GWGWNDISCSLKQKSVCPMKKINL 



XV. Primate DCMP2 

DCMP2, a putative asialoglycoprotein receptor, is a type n 

35 transmembrane protein. In its extracellular region, DCMP2 features a 

single carbohydrate recognition domain (CRD), characteristic of the C-type 
(Ca++ dependent) family of lectins (see Drickamer and Taylor (1993) Ann. 
Rev. Cell BioL 9:237-264. DCMP2 displays considerable homology with the 
two genes (HI and H2) encoding the subunits of the human hepatic 

40 asialoglycoprotein-receptor. Stockert (1995) Physiol. Revs, 75:591-609. 

These hepatic receptors represent the prototype of the type U C-type lectin 
family members. Liver ASGPR has binding specificity for desialylated 
glycoproteins displaying terminal galactosyl residues, and mediates their 
endocytosis into hepatocytes via the clathrin-coated pit pathway. Notably, 

45 the features associated with both these functions are conserved between the 
hepatic ASGPR and DCMP2. Thus, DCMP2 contains an intracellular motif 
including a tyrosine residue at position 5 and which is associated with 
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ligahd ehdbcytosis capacity. See Fuhrer, et al. (1991) 1 Cell BioL 114:423- 
431. in addition, the PCMF2 display a QPD (Gln-Pro-Asp) galactpse- 
recpgnitibn type sequence (Drickamer (1992) Nature 360:183^186) in its 
sugar recognition domain. 
5 Seiveral variant cDNA clones encoding the PGMP2 have been 

isolated, most likely as a consequence of alternative spilicing. Three 
variants are described hereunder: a short form, a long form, and a third 
form designated DCMP2v. See,SEQ ID NO: 4 and 10; Table 1. The short 
and long forms differ by the presence of a unique 27 aa insert in the 

10 exb-acellular region of the short form clone. The short foixn of the DCMP2 
exhibits 4 residue differences in the extracellular region to a recently cloned 
ASGPR obtained from human macrophages (M-ASGPR). Suzuki, et al. 
(1996) L Immunol 156:128-135. 

Relative to the DCMP21, the ASGPRm lacks the seqment corresponding to 

15 GVSELQEHTTQKAmGHCPHCPSVCVP (residues 118-144 of SEQ ID NO: 4), 
and the ASGPRm contains an insert of GEE (between residues 173 and 174 of 
SEQ ID NO: 4). The DCMP2s is identical to the DCMP21 except for the absence 
of the GVSELQEHTTQKAHLGHCPHCPSVCVP (residues 118-144 of SEQ ID 
NO: 4), and a difference in sequence at nucleotide 1064 from G to A, thereby 

20 encoding asn rather than asp. The DCMPv is similar to DCMPs, but lacks the 
sequence LLQRLRSGPCHLLLSLGLG (residues 30-48 of SEQ ID NO: 4), which 
corresponds to a significant portion of the transmembrane segment; and contains 
the insert of GEE (between residues 173 and 174 of SEQ ID NO: 4) as found in the 
ASGPRm. Regions surrounding these differences, e.g., within an epitope length, 

25 e.g., 12-17 amino acids, are of interest 

Recombinant DCMP2 long form protein is available, and m ABs have 
been generated. In addition, a murine cell line has been transfected for 
stable expression of both the long and short forms. 

The gene was originally isolated from 70% pure CDla"*" DC derived 

30 from CD34'^ hematopoietic progenitor cells cultured in GM-CSF and TNFa 
(Caux, et al. (1992) Nature 360:258-261. The clone has been inserted into a 
pSportl vector (Notl/Sall restriction sites). 

PGR analysis suggests expression of DCMP2 genes in dendritic cells, 
and perhaps very weakly in TFl (hematopoietic cell line) cells. There was 

35 not detectable signal from Jurkat (T cell tine), CHA (carcinoma cell tine), 
MRC5 (lung fibroblast sarcoma cell line), or JY (B cell line). Signal was 
detected in freshly isolated non-activated or activated (PMA and 
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ibnbmycin) dendritic cells> granulocytes, and non-activated or activated 
PBL. Signal was not detected in moivpcytes, non-activated or. activated T 
Gfells, or iipn-activated or Activated B cells. 

DC-ASPGR displays considerable homology with the murine 
5 counterpart of human monocyte ASGPR (M-ASGPR), Hpmolo^ is 

striking (7- 60%) within the carbohydrate-recognition domain which confers 
specificity to murine monocyte ASGPR for galactose and N- 
acetylgalactosamine (GalNAc). Sato, et (1992) T. Biochem. 111:331-336 . 
This includes the QPD motif, also found in the HI and H2 subunits of the 

10 hepatic ASGPR. In addition, murine monocyte ASGPR has a YENL 
intemalization signal in its cytosolic domain. 

Murine M- ASGPR functions as a receptor for endocytosis of 
galactosylated glycoproteins (Ozaki, et al. (1992) I Biol. Chem. 267:9229- 
9235), and allows recognition of malignant cells by tumoricidal 

15 macrophages (Kawakami, et al. (1994) Tpn, T. Cancer Res. 85:744-749^. .In 
tills context, murine M-ASGPR was found to be expressed within lung 
metastatic nodules of mice bearing OV2944-HM-1 metastatic ovarian tumor 
cells (Imai, et al. (1995) hnmunol. 86:591-598). Of interest human M- 
ASGPR demonstrates a remarkable specificity for Tn antigen (Suzuki, et al. 

2 0 (1996) T. Immunol. 156:128-135), which bears a cluster of serine or 
threonine-linked terminal GalNAc, and is associated with human 
carcinomas (Springer (1989) Mol. hnmunoL 26:1-5; and 0rntoft, et al. (1990) 
hit I Cancer 45:666-672). 

On the basis of sequence homology, it can be predicted that DCMPs 

25 also function as an endocytic receptor for galactosylated glycoproteins. In 
addition, ligand intemalization via the mannose-receptor, another C-type 
transmembrane endocytic lectin, results in highly efficient antigen- 
presentation by DC through the MHC class H palhway. Cella, et al. (1997) 
Current O pinion Immunol. 9:10-16. By analogy, it is possible that the 

30 DCMPs play a similar role in routing internalized hgands into an antigen- 
presentation pathway. 

Thus, DCMP2 could be a potential high-efficiency target for loading 
antigens into DC for enhancing presentation to T cells in immune-based 
adjuvant therapy. This could be approached by pulsing DC in vitro either 

35 with a galactosylated form of antigen, or with anti-DCMP2 mAbs coupled 
to antigen. In vitro efficiency of presentation could be assayed by 
activation of antigen-specific T cells. This would focus on presentation of 
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tumor-asspciated antigens (TAA), due to the inherent therapeutic 
perspectives of such an approach. Of particular interest are TAA 
assdciated \yith malignant melanoma* 

In addition/ tiie specificity of human M-ASGPR for Tri antigen 
5 makes this carcinoma TAA a candidate of choice for targeting the DCMP2. 

As has been recently shown that exogenous antigen can be 
processed and presented in the MHC class I patiiway. See Porgador and 
Gilbpa (1995) T. Exp. Med. 182:255-260: Paglia, et al. (1996) 1, Exp, Med. 
183:317-^322v Specialized receptors are likely to perform such a function iri 
ID DC 

These receptors in DC may be targetted to help produce TAA- 
specific cytotoxic T cells (CTL), with significant therapeutic potential, as 
GTL appear to be implicated in the induction of tumor rejection. 

15 XVI. PCMP Internalization 

DC obtained from CD34+ progenitors cultured in GM-CSF and 
TNFa were stainied at 4** C with anti-DCMP2 mAb, or anti-CD13 as control. 
Following subsequent incubation at 37* C for a period of up to about 20 
min, cell surface boimd mAbs were analyzed. Ihtemalization was 

2 0 observed by decrease in cell surface fluorescence. 

The DCMP21 is rapidly internalized at 37^ Q but not at 4** C. About 
60% of the surface label disappeared within about 15 min. This 
demonstrates that the DCMP2 can function as an endocytic receptor, 
consistent with the presence of an internalization motif (YENF) in its 
25 intracytoplasmic domain. 

XVn. Isolation of a binding counterpart 

A DC protein can be used as a specific binding reagent, by taking 
advantage of its specificity of binding, much like an antibody would be 

3 0 used. A binding reagent is either labeled as described above, e.g., 

fluorescence or otherwise, or immobilized to a substrate for panning 
methods. 

The DC protein is used to screen for a cell line which exhibits 
binding. Standard staining techniques are used to detect or sort 
3 5 intracellular or surface expressed ligand, or surface expressing transformed 
cells are screened by panning. Screening of intracellular expression is 
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pefforaied by various staining or immimofluoreseeriee procedures See 
ailso NicMahan, et al. (1991) EMBO T. 10:2821-2832. 

For example, on day 0, precoat 2-chamber perihanox slides with.l 
5 ml per chamber of fibronectin, 10 ng/ ml in PBS, for 30 min at room 

temperature. Rinse once with PBS. Then plate COS cells at 2-3 x 10^ cells 
per chamber in 1.5 ml of growth media. Incubate ovemight at 37_ C; 

On day 1 for each sample, prepare 0.5 ml of a solution of 66 mg/ml 
DEAE-dextran, 66 mM chloroquine, and 4 mg DNA in serum free DME. 
10 For each set, a positive control is prepared, e.g., of human receptor-FLAG 
cDNA at 1 and 1/200 dilution, and a negative mock. Rinse cells with 
serum free DME. Add the DNA solution and incubate 5 hr at 37_ C 
Remove the medium and add 0.5 ml 10% DMSO in DME for 2,5 min. 
Remove aiid wash once with DME. Add 1.5 ml growth medium and 
15 incubate overnight 

On day 2, change the medium. On days 3 or 4, the cells are fixed 
and stained. Rinse the cells twice with Hank's Buffered SaUne Solution 
(HBSS) and fix in 4% paraformaldehyde (PFA)/gIucose for 5 min. Wash 
3X with HBSS. The slides may be stored at -80_ C after all liquid is 
20 removed. For each chamber, 0.5 ml incubations are performed as follows. 
Add HBSS/saponin(0.1%) with 32 ml/ml of IM NaNs for 20 min. Cells 

are then washed with HBSS/ saponin IX. Add protein or protein/ antibody 

complex to cells and incubate for 30 min. Wash cells twice with 

HBSS/ saponin. If appropriate, add first antibody for 30 min. Add second 

25 antibody, e.g.. Vector anti-mouse antibody, at 1 /200 dilution, and incubate 
for 30 min. Prepare ELKA solution, e.g.. Vector Elite ABC horseradish 
peroxidase solution, and preincubate for 30 min. Use, e.g., 1 drop of 
solution A (avidin) and 1 drop solution B (biotin) per 2.5 ml 
HBSS/saponin. Wash cells twice with HBSS/saponin. Add ABC HRP 

3 0 solution and incubate for 30 min. Wash cells twice with HBSS, second 

wash for 2 min, which closes cells. Then add Vector diaminobenzoic acid 
(DAB) for 5 to 10 min. Use 2 drops of buffer plus 4 drops DAB plus 2 
drops of H2O2 per 5 ml of glass distilled water. Carefully remove chamber 

and rinse slide in water. Air dry for a few minutes, then add 1 drop of 
3 5 Crystal Mount and a cover slip. Bake for 5 min at 85-90_ C. 
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Alternatively, other monocyte protein specific binding reagents are 
lisfed to affinity purify or sort put ceUs expresising a receptor. See, e.g., 
Sambrook/ et al* or Ausubet et al. 

Another strategy is to screen for a membrane bound, receptor by 
5 paniiihg. The r^eptof cDNA is constructed as descri The 
ligand can be immobilized and used to immobilize expressing cells. 
Immobilization may be achieved by use of appropriate antibodies which 
recognize, e.g,> a FLAG sequence of a monocyte protein fusion construct^ or 
by use of antibodies raised against the first antibodies. Recursive cycles of 
10 selection and amplification lead to enrichment of appropriate clones and 
eventual isolation of ligand expressing clones. 

Phage expri^ion libraries can be screened by monocyte protein* 
Appropriate label techniques, e.g., anti-FLAG antibodies, will allow 
specific labeling of appropriate clones. 

15 

Many modifications and variations of this invention can be made 
without departing froin its spirit and scope, as will be apparent to those 
skilled in the art The specific embodiments described herein are offered 
by way of example only, and the invention is to be limited only by the 
20 terms of the appended claims, along with the full scope of equivalents to 
which such claims are entitled. 
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WHAT BCLAIMIDIS: 

1.. A substantially pure or recombinant DCMPl polypeptide exhibitirig 
5 at least about 85% sequence identib|^ to SEQ ID NQ: 2 or 8. 

2. A substantially pure or recombinant DCMF2 polypeptide^ 
comprising:. 

a) a polypeptide sdected from: 

1) Gly Val Ser Glu Leu Gin Glu His Thr Thr Gin Lys Ala His 
Leu Gly His Cys Pro His Cys Pro Ser Val Gys Val Pro 
(residues 118-144 of SEQ ID NO: 4); 

2) Gin Val Ala Thr Leu Asn Asn. Asn Ala Ser Thr Glu Gly 
Thr Cys Cys (residues 166-181 of SEQ ID NO: 4); or 

3) Trp Lys Pro Gly Gin Pro Asp Asn Trp Gin Gly His Gly 
Leu Gly (residues 263-277 of SEQ ID NO: 4); or 

b) sequence exhibiting both: 

20 1) at least 17 contiguous amino adds from DCMP2v as 

described in SEQ ID NO: 10; and 
2) a lack of a segment of at least 12 contiguous amino acids 
from FKNGPLPLQS LLQRLRWGPC HLLLWLGLGL 
LLLVnC (residues 20-56 of SEQ ID NO: 4). 

25 

3. A fusion protein comprising the polypeptide of claim 1 or 2. 

4. A binding compoxmd which specifically binds to the polypeptide of 
30 claim 1 or Z 

5. The binding compound of claim 4 which is an antibody or antibody 
fragment 

35 6. A nucleic acid encoding the polypeptide of claim 1 or 2. 

7. An expression vector comprising the nucleic acid of claim 6. 

8. A host cell comprising the vector of claim 7. 

40 



10 
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'9. A< process for r^combinatly producing a polypeptide eomprising 
culturing the host cell of claim 8 under conditioiis in which tiie polypeptide 
is expressed. 
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SEQUENCE SUBMISSION 

SEQ ID NO: 1 is primate DCMPl C-lectin family gene nucleotide sequence* 

SEQ ID NO: 2 is primate DCMPl lectin family gene polypeptide sequence. 

SEQ ID NO: 3. is primate DCMP2 C-lisctin family gene nucleotide sequence. 

SEQ ID NO: 4 is primate DCMP2 C-lectin family gene polypeptide sequence. 

SEQ ID NO: 5 is primate hepatic ASGPRhl polypeptide sequence. 

SEQ ID NO: 6 is primate hepatic .ASGPRh2 polypeptide sequence. 

SEQ ID NO: 7 is rodent DGMPl G~lectin family gene nucleotide sequence. 

SEQ ID NO: 8 is rodieht DCMPl C-^lectin family gene polypeptide sequence^ 

SEQ ID NO: 9 is variant primate DCMP2 nucleotide sequence. 

SEQ ID NO: 10 is variant primate DCMP2 polypeptide sequence. 

SEQ ID NO: 11 is peptide sequence from W33446 rodent EST. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 



(A) 
(B) 
(C) 
(D) 
(E) 
(F) 
(G) 
(H) 



NAME: 

STREET : 

CITY: 

STATE: 

COUNTRY: 

ZIP: 

TELEPHONE: 
TELEFAX: 



Schering Corporation 

2000 galloping Hill Road 

Kenilworth 

New Jersey 

U.S.A. 

07033 

(908) 298-5056 
(908) 298-^5388 



(ii) TITLE OF INVENTION: Mammalian Membrane Protein Genes; 
Related Reagents 



(iii) NUMBER OF SEQUENCES: 11 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Power Macintosh 

(C) OPERATING SYSTEM: 8.1.0 

(D) SOFTWARE: Microsoft Word 6.0.1 



(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/053,080 

(B) FILING DATE: 09-JUL-1997 



(2) INFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) .NAME /.KEY ; CDS, 

(B) LOCATION: 242. .952 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TTCTCACTAT ACTGGTCCTG AGGAAAGGGC TTCTGTGAAC TGGGGTTTTT AGTTTTTATT 50 

GTGGTTCTTA GTTCTGATGA GACCCCTCTT GAGGATATGT GCCTATCTGG TGCCTCTGCT 120:' 

CTCCACTAGT TGAGTGAAAG GAAGGAGGTA ATTTACCACC ATGTTTGGTT CCTGTTTATA 18ff 

AGATGTTTTA A6AAAGATTT GAAACAGATT TTCTQAAGAA AGCAGAAGCT CtCTTCCCAT 240^ 

T ATG ACT TOG GAA ATC ACT TAT GCT GAA GTG AGG TTC AAA AAT GAA 286 
Met Thr Ser Glu He Thr Tyr Ala Glji Val Arg Phe Lys Asn Glu 
15 10 15 

TTC AAG TCC TCA GGC ATC AAC ACA GCC TCT TCT GCA GCT TCC AAG GAG 334 
Phe Lys Ser Ser Gly He Asn Thr Ala Ser Ser Ala Ala Ser Lys Glu 
20 25 30 

AGG ACT GCC CCT CTC AAA AGT AAT ACC GGA TTC CCC AAG CTG CTT TGT 382 
Arg Thr Ala Pro Leu Lys Ser Asn Thr Gly Phe Pro Lys Leu Leu Cys 
35 40 45 

GCC TCA CTG TTG ATA TTT TTC CTG CTA TTG GCA ATC TCA TTC TTT ATT 430 
Ala Ser Leu Leu He Phe Phe Leu Leu Leu Ala He Ser Phe Phe He 
50 55 60 

GCT TTT GTC ATT TTC TTT CAA AAA TAT TCT CAG CTT CTT GAA AAA AAG . 478 
Ala Phe Val He Phe Phe Gin Lys Tyr Ser Gin Leu Leu Glu Lys Lys 
65 70 75 

ACT ACA AAA GAG CTG GTT CAT ACA ACA TTG GAG TGT GTG AAA AAA AAT 526 
Thr Thr Lys Glu Leu Val His Thr Thr Leu Glu Cys Val Lys Lys Asn 
80 85 90 95 

ATG CCC GTG GAA GAG ACA GCC TGG AGC TGT TGC CCA AAG AAT TGG AAG 574 
Met Pro Val Glu Glu Thr Ala Trp Ser Cys Cys Pro Lys Asn Trp Lys 
100 105 110 

TCA TTT AGT TCC AAC TGC TAC TTT ATT TCT ACT GAA TCA GCA TCT TGG 622 
Ser Phe Ser Ser Asn Cys Tyr Phe He Ser Thr. Glu Ser Ala Ser Trp 
115 120 125 

CAA GAC AGT GAG AAG GAC TGT GCT AGA ATG GAG GCT CAC CTG CTG GTG 670 
Gin Asp Ser Glu Lys Asp Cys Ala Arg Met Glu Ala His Leu Leu Val 
130 135 140 

ATA AAC ACT CAA GAA GAG CAG GAT TTC ATC TTC CAG AAT CTG CAA GAA 718 
He Asn Thr Gin Glu Glu Gin Asp Phe He Phe Gin Asn Leu Gin Glu 
145 150 155 

GAA TCT GCT TAT TTT GTG GGG CTC TCA GAT CCA GAA GGT CAG CGA CAT 766 
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Glu Ser Ala Tyr Phe Val Gly Leu Ser Asp Pro Glu. Gly Gin Arg His 

165 170 175 

TGG CAA TGG GTT GAT CAG -ACA CCA TAG AAT GAA AGT TQC ACA T.TC: TGG 814: 
Trp Gin Trp Val Asp Glri, Thr Pro Tyr Asn Glu Ser Ser Thr Phe Trp 
180 185 190 

CAT CCA CGT GAG CCC AGT GAT CCC AAT GAG CGC TGC GTT GTG CTA AAT 862 
His Pro Arg Glu Pro Ser Asp Pro Asn. Glu Arg Cys Val Val Leu Asn 
195 200 205 

TTT CGT AAA TCA CCC AAA AGA TGG GGC TGG AAT GAT GTT AAT TGT CTT 910 
Phe Arg Lys Ser Pro Lys Arg Trp Gly Trp Asn Asp Val Asn Cys Leu 
210 215 220 

GGT CCT CAA AGG TCA GTT TGT GAG ATG ATG AAG ATC CAC TTA 952 
Gly Pro Gin Arg Ser Val Cys Glu Met Met Lys He His Leu 
225 230 235 

TGAACTGAAC ATTCTCCATG AACAGGTGGT TGGATTGGTA TCTGTCATTG TAGGGATAGA 1012 

TAATAAGCTC TTCTTATTCA TGTGTAAGGG AGGTCCATAG AATTTAGGTG GTCTGTCAAC 107^2 

TATTCTACTT ATGAGAGAAT TGGTCTGTAC AT IIO4 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Ser Glu He Thr Tyr Ala Glu Val Arg Phe Lys Asn Glu Phe 
15 10 15 

Lys Ser Ser Gly He Asn Thr Ala Ser Ser Ala Ala Ser Lys Glu Arg 
20 25 30 

Thr Ala Pro Leu Lys Ser Asn Thr Gly Phe Pro Lys Leu Leu Cys Ala 
35 40 45 

Ser Leu Leu He Phe Phe Leu Leu Leu Ala He Ser Phe Phe He Ala 
50 55 60 

Phe Val He Phe Phe Gin Lys Tyr Ser Gin Leu Leu Glu Lys Lys Thr 
^5 70 75 80 

Thr Lys Glu Leu Val His Thr Thr Leu Glu Cys Val Lys Lys Asn Met 
85 90 95 

Pro Val Glu Glu Thr Ala Trp Ser Cys Cys Pro Lys Asn Trp Lys Ser 
100 105 110 
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Phe Ser S^s:. Asn Cys Tyr Phe lie Ser^" Thr Glu Ser Ala Ser Trpi Glii 
lis 120 125 

Asp Ser Glu Lys Asp Cys Ala Arg Met Glu Ala His Leu Leu Val lie, 
±30 135 

. Asii Thr Gin Glu Glu Gin Asp Phe lie Phe Gin Asn Leu Gin Glu Glu 
1^5 Istf 155 150. 

Ser Ala Tyr Phe Val Gly Leu Ser Asp Pro Glu Gly Gin Arg His Tip 
165 170 175 

Glii Trp Val Asp Gin Thr Pro tyr Asn Glu Ser Ser thr Phe Trp His 
180 I8i5 190 

Pro Arg Glu Pro Ser Asp Pro Asn Glu Arg Cys Val Val Leu Asn Phfe 
155 200 205 

Arg Lys Ser Pro Lys Arg Trp Gly Trp Asn Asp Val Asn Cys Leu Gly 
210 215 220 

Pro Gin Arg Ser Val Cys Glu Met Met Lys He His Leu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 257-. 1204 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 608 

(D) OTHER INFORMATION: /note= "short form lacks 
nucleotides 608-673" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 775 

(D) OTHER INFORMATION: /note= "ASGPRm (table 2) has 
sequence insert encoding GEE between nucleotides 775-776" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION; 1064 

(D) OTHER INFORMATION: /note= "nucleotide 1064 of DCMP2s 
may be A, which would encode asn rather than asp at the residue 
numbered 270" 
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(Xi) SEQUmCE DESGRIPTIdN;:; SEQ ID NO : 3 : 

GTTGAGGAGA TGGGATGTCC CAGATGATAG GGCTCGTGGG ATTTCAGACC: CAAGACtAGG * 60 

AGGACTCCAG TCACCTCTAC CCCAGCTeTC CAGGACACAG CGCTCCGAA'C TCTGAGTGAC 120 

.GTGCCAGCTC TGGTCCTTGC AGCACAACGA ACGTCGGAAT GACAGCCTCC AGACCTqCGA 18G 

.CAGGTCCACC CCAGACTGGG GGCCGGGGCT GCpTCGATTT CAGCTGTGAC AACCTCAGAG 24Q 

tCGTCTTCSGC CGAAGC ATG ACA AGG ACG TAT GAA AAC TTC GAG TAG TTG .289 
Met Thr Arg Thr Tyr Glu Asn Phe Gin Tyr Leu 
1 5 10 

GAG AAT AAG GTG :AAA GTC GAG GGG TTT AAA AAT GGG CCA CTT CCT CTC 33-7. 
Glu Asn Lys Val Lys Val Gin Gly Phe Lys Asn Gly Pro Leu Pro Leu 
15 20 25 

GAG TCC CTC GTG CAG CGT CTC CGG rrCT GGG CCC TGG CAT CTC CTG CTG 385 
Gin Ser Leu Leu Gin Arg Leu Arg Ser Gly Pro Cys His Leu Leu Leu 
30 35 40 

TCC CTG GGC CTC GGG CTG CTG CTG CTG GTC ATG ATC TGT GTG GTT GGA 433 
Ser Leu Gly Leu Gly Leu Leu Leu Leu Val lie lie Cys Val Val Gly 
45 50 55 

TTC CAA AAT TCC AAA TTT CAG AGG GAC CTG GTG ACG CTG AGA ACA GAT 481 
Phe Gin Asn Ser Lys Phe Gin Arg Asp Leu Val Thr Leu Arg Thr Asp 
60 65 70 75 

TTT AGC AAC TTC ACC TCA AAC ACT GTG GCG GAG ATC CAG GCA CTG ACT 529 
Phe Ser Asn Phe Thr Ser Asn Thr Val Ala Glu He Gin Ala Leu Thr 
80 85 90 

TCC CAG GGC AGC AGC TTG GAA GAA ACG ATA GCA TCT CTG AAA GCT GAG 577 
Ser Gin Gly Ser Ser Leu Glu Glu Thr He Ala Ser Leu Lys Ala Glu 
55 100 105 

GTG GAG GGT TTC AAG CAG GAA CGG CAG GCA GGG GTA TCT GAG CTC CAG 625 
Val Glu Gly Phe Lys Gin Glu Arg Gin Ala Gly Val Ser Glu Leu Gin 

115 120 

GAA CAC ACT ACG CAG AAG GCA CAC CTA GGC CAC TGT CCC CAC TGC CCA 673 
Glu His Thr Thr Gin Lys Ala His Leu Gly His Cys Pro His Cys Pro 
125 130 135 

TCT GTG TGT GTC CCA GTT CAT TCT GAA ATG CTC CTG CGA GTC CAG CAG 721 
Ser Val Cys Val Pro Val His Ser Glu Met Leu Leu Arg Val Gin Gin 
140 , 145 150 155 

CTG GTG CAA GAC CTG AAG AAA CTG ACC TGC CAG GTG GCT ACT CTC AAC 
Leu Val Gin Asp Leu Lys Lys Leu Thr Cys Gin Val Ala Thr Leu Asn 
1^0 165 170 

AAC AAT GGC TCC ACT GAA GGG ACC TGC TGC CCC GTC AAC TGG GTG GAG 
Asn Asn Ala Ser Thr Glu Gly Thr Cys Cys Pro Val Asn Trp Val Glu 
175 180 185 



769 
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CAC CAA GAC AQG TGC; TAG TGG TTC: TCT CUC TCT GGG ATG; TCC ¥00 GCC 865 
His Gin Asp Ser Cys Tyr Trp Phe; Sex; His Sjer Gly Met Ser Trp Ala 
190 195, 200 

GAG GCT. GAG AAG TAG TGC .GAG CTG AAG ,AAO GGC- GAG CTC. GTG GTC ATG 913 
Glu Ala Glu Lys Tyr :Gys Gin Liau: Lys Asii Ala Hiis Leu Val Val lie 
205 210 215. 

AAC TCC AGG GAG GAG CAG AA% TTT GTC GAG. AAA TAT GTA GGG TGC GCA 961 
Asn Ser Arg Glu Glu Gin Asn Phe Val Gin Lys Tyr Leu Gly Ser Ala 
220 225 230 235 

TAG ACC TGG ATG GGC CTC AGT GAG- GCT GAA GGA GCC. TGG AAG. TGG GTG 1009; 
Tyr Thr trp Met Gly Leu Ser Asp Pro Glu Gly Ala Trp Lys Trp Val 
240 245 250 

GAT GGA ACA GAC TAT GCG ACC GGC TTC CAG AAC TGG AAG CCA GGC GAG 1057 
Asp Gly Thr Asp Tyr Ala Thr Gly Phe Gin Asn Trp Lys Pro Gly Gin 
255 260 265 

CCA GAG GAG TGG GAG GGG CAC GGG CTG GGT GGA GGC GAG GAC TGT GCT 1105 
Pro Asp Asp Trp Gin Gly His Gly Leu Gly Gly Gly Glu Asp Cys Ala 
270 275 280 

CAC TTC CAT CCA GAC GGC AGG TGG AAT GAC GAC GTC TGC CAG AGG GCC 1153 
His Phe His Pro Asp Gly Arg Trp Asn Asp Asp Val Cys Gin Arg Pro 
285 290 295 

TAG CAC TGG GTC TGC GAG GCT GGC CTG GGT CAG ACC AGG CAG GAG AGT 1201 
Tyr His Trp Val Cys Glu Ala Gly Leu Gly Gin Thr Ser Gin Glu Ser 
300 305 310 315 

CAC TGAGCTGCCT TTGGTGGGAC CACCCGGCCA GAGAAATGGC GGTGGGAGGA 1254 
His 



GGACTCTTGT CACGACCTCC TCGCAAGACC GCTCTGGGAG AGAAATAAGC ACTGGGAGAT 1314 

TGGAAGCACT GCTAAGATTT TGAATTTTTT TCTCTTTAAT TTTAAAAAGA TGGTATAGTG 1374 

TTCTTAAGCT TTTATTTTTT TTCCAACTTT TGAAAGTCAA CTTCATGAAG GTATAATTTT 1434 

TACATAATAA AAATGGAGTC ATTT 145g 

(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Arg Thr Tyr Glu Asn Phe Gin Tyr Leu Glu Asn Lys Val Lys 
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i 5 16 15 

Val gin Gly Phe Lys AsiX: ply Pro Leu. Pro Leu Gin, Ser Leu Leu Gin 
20 25 30 

Arg Leu Arg Ser' Gly Prp Cys His Leu Leu Leu Ser Leu fGly Leu Gly 
3:5- 40 45 

Leu Leu Leu Leu. Val He He Cys Val Val Gly Phe Gin Asn Ser Lys 
50 55 60 

Phe Gin Arg Asp Leu Val Thr Leu Arg Thr Asp Phe Ser Asn Phe Thir 
^5 70 75. 80 

Ser Asn Thr Val Ala Glu lie Gin Ala Leu Thr Ser Gin Gly Ser Ser 
85 90 95 

Leu Glu Glu Thr lie Ala Ser Leu Lys Ala. Glu Val Glu Gly Phe Lys 
100 105 116 

Gixi Glu Arg Gin Ala Gly Val Ser Glu Leu Gin Glu His Thr Thr Gin 
115 120 125 

Lys Ala His Leu Gly His Cys Pro His Cys Pro Ser Val Cys Val Pro 
130 135 

Val His Ser Glu Met Leu Leu Arg Val Gin Gin Leu Val Gin Asp Leu 

150 155 160 

Lys Lys Leu Thr Cys Gin Val Ala Thr Leu Asn Asn Asn Ala Ser Thr 
165 170 175 

Glu Gly Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Asp Ser Cys 
180 185 190 

Tyr Trp Phe Ser His Ser Gly Met Ser Trp Ala Glu Ala Glu Lys Tyr 
195 200 205 

Cys Gin Leu Lys Asn Ala His Leu Val Val He Asn Ser Arg Glu Glu 
210 215 220 

Gin Asn Phe Val Gin Lys Tyr Leu Gly Ser Ala Tyr Thr Trp Met Gly 
225 230 235 240 

Leu Ser Asp Pro Glu Gly Ala Trp Lys Trp Val Asp Gly Thr Asp Tyr 
245 250 255 

Ala Thr Gly Phe Gin Asn Trp Lys Pro Gly Gin Pro Asp Asp Trp Gin 
260 265 270 

Gly His Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe His Pro Asp 
275 280 285 

Gly Arg Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr His Trp Val Cvs 
290 295 300 

Glu Ala Gly Leu Gly Gin Thr Ser Gin Glu Ser His 
305 310 315 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(1) STOUEas^CE CHAiyiCTEKIiSTi 

(A) LENGTH: 291 amino acids 

(B) TYPE:: amino acid 

(C) STRANDEDNESS : . not relevant 

(D) TOPOLOGY: linear 

(.ii) MOLECULE, TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Lys Glu Tyr Glri Asp Leu Gin His Leu Asp Asn, Glu Glu Ser 
15 10 15 

Asp. His His Gin Leu Arg Lys Gly Pro Pro Pro Pro Gin Pro Leu Leu 
20 25 30 

Gin Arg Leu Cys Ser Gly Pro Arg Leu Leu Leu Leu Ser Leu Gly Leu 
35 40 45 

Ser Leu Leu Leu Leu Val Val Val Cys Val He Gly Ser Gin Asn Ser 
50 55 60 

Gin Leu Gin Glu Glii Leu Arg Gly Leu Arg Glu Thr Phe Ser Asn Phe 

70 75 80 

Thr Ala Ser Thr Glu Ala Gin Val Lys Gly Leu Ser Thr Gin Gly Gly 
85 90 95 

Asn Val Gly Arg Lys Met Lys Ser Leu Glu Ser Gin Leu Glu Lys Gin 
100 105 110 

Gin Lys Asp Leu Ser Glu Asp His Ser Ser Leu Leu Leu His Val Lys 
115 120 125 

Gin Phe val Ser Asp Leu Arg Ser Leu Ser Cys Gin Met Ala Ala Leu 
130 135 3^4Q 

Gin Gly Asn Gly Ser Glu Arg Thr Cys Cys Pro Val Asn Trp Val Glu 

150 155 160 

His Glu Arg Ser Cys Tyr Trp Phe Ser Arg Ser Gly Lys Ala Trp Ala 
165 170 175 

Asp Ala Asp Asn Tyr Cys Arg Leu Glu Asp Ala His Leu Val Val Val 
180 185 

Thr Ser Trp Glu Glu Gin Lys Phe Val Gin His His He Gly Pro Val 
195 200 205 

Asn Thr Trp Met Gly Leu His Asp Gin Asn Gly Pro Trp Lys Trp Val 
210 215 220 
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Asp Gly Thr Asp Tyr Glu Thr Gly Phe Lys Asn Trp Arg Pr.d Glu Gin 
225 230' 235 240 

Pro Asp Asp Trp Tyr Gly ;His Gly Leu Gly Gly .Gly Glu Ase^ Cys Ala 
245 250 255 

His Phe Thr Asp Asp Gly Arg Trp Asn Asp Asp Val Cys Gin Arg Pro 
260 265 270 

Tyr Arg Trp Val Cys Giu Thr Glu Leu Asp Lys Ala Ser Gin Glu Pro 
275 280 285 

Pro Leu Leu 
290 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.: 287 amino acids 

(B) TYPE: amino acid 

(G) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Lys Asp Phe Gin Asp He Gin Gin Leu Ser Ser Glu Glu Asn 
15 10 15 

Asp His Pro Phe His Gin Gly Pro Pro Pro Ala Gin Pro Leu Ala Gin 
20 25 30 

Arg Leu Cys Ser Met Val Cys Phe Ser Leu Leu Ala Leu Ser Phe Asn 
35 40 45 

He Leu Leu Leu Val Val He Cys Val Thr Gly Ser Gin Ser Ala Gin 
50 55 60 

Leu Gin Ala Glu Leu Arg Ser Leu Lys Glu Ala Phe Ser Asn Phe Ser 

70 75 80 

Ser Ser Thr Leu Thr Glu Val Gin Ala He Ser Thr His Gly Gly Ser 
85 90 95 

Val Gly Asp Lys He Thr Ser Leu Gly Ala Lys Leu Glu Lys Gin Gin 
100 105 110 

Gin Asp Leu Lys Ala Asp His Asp Ala Leu Leu Phe His Leu Lys His 
115 120 125 

Phe Pro Val Asp Leu Arg Phe Val Ala Cys Gin Met Glu Leu Leu His 
130 135 140 

Ser Asn Gly Ser Gin Arg Thr Cys Cys Pro Val Asn Trp Val Glu His 
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150 155 160 

Gin Gly Ser Cys Tyr Trp Phe Ser His Ser Gly Lys Ala Trp Ala .Glu 
165 170 ' iti 

Ala Glu Lys Tyr Cys Gin Leu Glu Asn Ala His Leu Val Val lie Asn 
180 185 ^ 190 

Ser Trp Glu Glu Gin Lys Phe lie Val Gin His Thr Asn Pro Phe Asn> 
195 200 205. 

Thr Trp He Gly Leu Thr Asp Ser Asp Gly Ser Trp Lys Trp Val Asp 
210 215 220 

Gly Thr Asp Tyr Arg His Asn Tyr Lys Asn Trp Ala Val Thr Gin Pro 
225 230 235 240 

Asp Asn Trp His Gly His Glu Leu Gly Gly Ser Glu Asp Cys Val Glu 
245 250 255 

Val Gin Pro Asp Gly Arg Trp Asn Asp Asp Phe Cys Leu Gin Val Tyr 
260 265 270 

Arg Trp Val Cys Glu Lys Arg Arg Asn Ala Thr Gly Glu Val Ala 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 279.. 992 

(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATION: 134 8 

(D) OTHER INFORMATION: /note= "poly-A addition motif 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTATCCCCCA CTTTGCAGTA CTTGCATATC TTGCTGAGTG GGTTTGAGGG CTACAATTCT 60 

TATTTTCTTA TGTTAAGAGG TTGCATTTCC CTTATCTCGC CCTGGTGATT CTATGCTGTG 120 

GTTTCTTGTT CTCATCTCGT TTATCCTAGT GAGACATGTC TCTTCTTTCA TACAACTGTG 180 

CAATATGACA ACTTATCACA GTGATTGGTT CTCATATACT ATAGAGCCTT AGAGAAGGAA 240 

CAAGGCTCTC TTCTGACGGA GGAAGATTTT TTCTTGAT ATG GOT TCA GAA ATC 293 
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Met Aia Ser Glu lie 
I 5 

ACT TAT GQh GAA. <3TG: AAG TTC AAG; AAT GAA .TCC AAC TCC TTG CAC AGC 341 
Thr Tyx Ala Glti Val Lys Phe Lys Asn Glu Setr Asn Ser Lew His Thr 
iO 15 20 

TAG TCA GAA TOT OCT GCA GCT CCC AGA GAG AAA OCT ATG CGT GAT CTA 389 
Tyr Ser Glu Ser Pro Ala Ala Pro Arg Glu. Lys Pro lie Arg Asp* Leu 
25 30 35 

AGA AAG CCT GGT TCC CCC TCA CTG CTT CTT ACA TCC CTG ATG CTA CTT 437 
Arg Lys Pro Gly Ser Pro Ser Leu Leu Leu Thr Ser Leu JAet Leu Leu 
40 45 50 

CTC CTG CTG CTG GCA ATC ACA TTC TTA GTT GCT TTT ATC ATT TAT TTT 485 
Leu Leu Leu Leu Ala lie Thr Phe Leu Val Ala Phe Tie lie Tyr Phe 
55 60 65 

GAA AAG TAC TCT CAA CTT CTT GAA GAA AAA AAA GGT GCA AAA AAT ATA 533 
Gin Lys. Tyr Ser ^Gln Leu Leu Glu. Glu Lys Lys Ala Ala Lys Asn lie 
70 75 80 85 

ATG CAC AAT GAA TTG AAC TGC ACA AAA AGT GTT TCA CCC ATG GAA GAC 581 
Met His Asn Glu Leu Asn Cys Thr Lys Ser Val Ser Pro Met Glu Asp 
90 95 100 

AAA GTC TGG AGC TGT TGC CCA AAG GAT TGG AGG CTA TTT GGT TCC CAC 629 
Lys Val Trp Ser Cys Cys Pro Lys Asp Trp Arg Leu Phe Gly Ser His 
105 110 115 

TGC TAC TTG GTT CCC ACA GTT TCT TCA TCA GCA TCT TGG AAC AAG AGT 677 
Cys Tyr Leu Val Pro Thr Val Ser Ser Ser Ala Ser Trp Asn Lys Ser 
120 125 130 

GAG GAG AAC TGC TCC CGC ATG GGT GCT CAT CTA GTG GTG ATC CAA AGC 725 
Glu Glu Asn Cys Ser Arg Met Gly Ala His Leu Val Val lie Gin Ser 
135 140 145 

CAG GAA GAG CAG GAT TTC ATC ACT GGG ATC TTG GAC ACT CAT GCT GCT 773 
Gin Glu Glu Gin Asp Phe lie Thr Gly He Leu Asp Thr His Ala Ala 
150 155 160 165 

TAT TTT ATA GGG TTG TGG GAT ACA GGC CAT CGG CAA TGG CAA TGG GTT 821 
Tyr Phe He Gly Leu Trp Asp Thr Gly His Arg Gin Trp Gin Trp Val 
170 175 180 

GAT CAG ACA CCA TAT GAA GAA AGT ATC ACA TTC TGG CAC AAT GGT GAG 869 
Asp Gin Thr Pro Tyr Glu Glu Ser He Thr Phe Trp His Asn Gly Glu 
185 190 195 

CCC AGC AGT GGC AAT GAA AAA TGT GCT ACA ATA ATT TAC CGT TGG AAG 917 
Pro Ser Ser Gly Asn Glu Lys Cys Ala Thr He He Tyr Arg Trp Lys 
200 205 210 



ACT GGA TGG GGC TGG AAC GAT ATC TCT TGC AGT CTT AAA CAG AAG TCA 
Thr Gly Trp Gly Trp Asn Asp He Ser Cys Ser Leu Lys Gin Lys Ser 
215 220 225 



965 
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GTT TGT CAG ATG AAG AAA- ATA AAC TTA TGAATGACTC ATTCTTCATG 1012 
Val Cys Gin Met Lys Lys lie Asn Leu 
230 235 

GGqATTGGAT TCATTGTtAT CCAACCATTA CACAGACAqC TCGGAAATTC TACAGGTTCA 1072: 

tAGAAtTVTAA GTGGGCAGCA AATGGTTATG CATACACT6G .GCCAGAtATA TGCTTCSTGCA 1132 

TTTACCCACG TACTCTGTCA TAAAATGAAG TTTCATTGAG AATTTTCTAT ATACCACAGA 1192 

GTATACAGAG TCCCTTATGG ACACACATGG AACTTTTTGC CATCTTGTTT ACTCATGCCA 1252 

TTGTATGATA GGTTCTCTTG ACCTATCTGT TTCTGTTTCT CTGTTGTTTT TTTAATGtCt 1312 

TTGGATirrAT TGACATTAAA TTGAGAAGTA AAATTATAAA TATTTAAGTG TCTGGATTGA 1372 

TAGACACAGA TATGTACTAT GAAATATAAT TAAATATTTA CTGTCC 1418 

{?) INFORMATION FOR SEQ ID NO: 8: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ser Glu He Thr Tyr Ala Glu Val Lys Phe Lys Asn Glu Ser 
1 5 10 15 

Asn Ser Leu His Thr Tyr Ser Glu Ser Pro Ala Ala Pro Arg Glu Lys 
20 25 30 

Pro He Arg Asp Leu Arg Lys Pro Gly Ser Pro Ser Leu Leu Leu Thr 
35 40 45 

Ser Leu Met Leu Leu Leu Leu Leu Leu Ala He Thr Phe Leu Val Ala 
50 55 60 

Phe He He Tyr Phe Gin Lys Tyr Ser Gin Leu Leu Glu Glu Lys Lys 
65 70 75 80 

Ala Ala Lys Asn He Met His Asn Glu Leu Asn Cys Thr Lys Ser Val 
85 90 95 

Ser Pro Met Glu Asp Lys Val Trp Ser Cys Cys Pro Lys Asp Trp Arg 
100 105 110 

Leu Phe Gly Ser His Cys Tyr Leu Val Pro Thr Val Ser Ser Ser Ala 
115 120 125 

Ser Trp Asn Lys Ser Glu Glu Asn Cys Ser Arg Met Gly Ala His Leu 
130 135 140 
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Val Vai He Glh Ser Gin Glu: Glu Gin Asp Phe lie Thr, Gly lie Leu 
145 156 155 160 

Asp Thr His Ala. Ala Tyr Phe= He Gly Leu Tn> Asp Thr Gly His Arg 

165 176 ifi 

Gin Trp Gin Trp Val Asp Gin Thr Pro Tyr Glu Glu Ser He Thr Phe 
180 185 190 

Trp. His .Asn Gly Glu Pro Ser Ser Gly Asn Glu Lyis Cys Ala Thr He 
195 200 265 

He Tyr Arg Trp Lys Thr Gly Trp Gly Trp Asn Asp He Ser Cys Ser 
210 215 220 

Leu Lys Gin liys Ser Val Cys Gin Met Lys Lys He Ash Leu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1370 base pairs 

(B) TYPE:: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 273.. 1091 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AAAGCATGGT CTCTGTGTGT TCTAATCCCT GTTCATTCTC ATTTACTGTC CCTGGGATTT 60 

CAGATCCAAG ACCAGCAGGA CTCCAGTCAC CTCTACCCCA GCTCTCCAGG ACACAGCGCT 120 

CCCAACTCTG AGTGACGTCC CACCTCTGGT CCTTGCAGCA CAACCAACGT GGGAATCACA 180 

CCCTCCAGAC CTCCCACAGC TCCACCCCAG ACTGGGCGCC GGCCCTGCCT CCATTTCAGC 240 

TGTGACAACC TCAGAGCCGT GTTGGCCCAA GC ATG ACA AGG ACG TAT GAA AAC 293 

Met Thr Arg Thr Tyr Glu Asn 
1 5 

TTC CAG TAC TTG GAG AAT AAG GTG AAA GTC CAG GGG TTT AAA AAT GGG 341 
Phe Gin Tyr Leu Glu Asn Lys Val Lys Val Gin Gly Phe Lys Asn Gly 
10 15 20 

CCA CTT CCT CTC CAG TCC CTC CTG CTG CTG GTC ATC ATC TGT GTG GTT 389 
Pro Leu Pro Leu Gin Ser Leu Leu Leu Leu Val He He Cys Val Val 
25 30 35 

GGA TTC CAA AAT TCC AAA TTT CAG AGG GAC CTG GTG ACC CTG AGA ACA 437 
Gly Phe Gin Asn Ser Lys Phe Gin Arg Asp Leu Val Thr Leu Arg Thr 
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4P 45 50: 55: 

GAT TTT AGG AAG .TTC ACC TCA AAC ACT GTG GCG GAG ATC GAG GGA GTG 48.5' 
Asp Phe- Ser Asn, Phe Thr Ser Asn thr Val. Ala Giu lie. Gin Ala Leu 
66 65 70 

ACT TCC CAG GGC AGC AGC TTG GAA GAA ACG ATA GCA TCT CTG AAA GCT 533 
Thr Ser Gin Gly Ser Ser Lett Glu Glu Thr lie Ala Ser Ley Lys Ala' 
75 80 85 

GAG GTG GAG- GGT TTC AAG CAG GAA CGG CAG GCA GTT CAT TCT GAA ATG 581 
Glu Val Glu Gly Phe: Lys Gin Glu Arg Gin Ala Val His Ser Glu Met 
90 95 100 

CTC CTG CGA GTC GAG CAG CTG GTG CAA GAC CTG AAG AAA CTG AGC TGC 629 
Leu Leu Arg Val Gin Gin Leu Val Gin Asp Leu Lys Lys Leii thr Cys 
105 liO 115 

CAG GTG GGT ACT CTC AAC AAC AAT GGT GAG GAA GCC TCC ACT GAA GGG 677 
Gin Val Ala Thr Leu Ash Asn Asn Gly Glu Glu Ala Ser Thr Glu Gly 
120 125 130 1-35 

ACC TGC TGC CCC GTC AAC TGG GTG GAG CAC CAA GAG AGC TGC TAG TGG 725 
Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Asp Ser Cys Tyr Trp 
140 145 150 

TTC TCT GAC TCT GGG ATG TCC TGG GCC GAG GCT GAG AAG TAG TGC CAG 773 
Phe Ser His Ser Gly Met Ser Trp Ala Glu Ala Glu Lys Tyr Cys Gin 
155 160 165 

CTG AAG AAC GCC CAC CTG GTG GTC ATC AAC TCC AGG GAG GAG GAG AAT 821 
Leu Lys Asn Ala His Leu Val Val He Asn Ser Arg Glu Glu Gin Asn 
170 175 180 

TTT GTC CAG AAA TAT CTA GGC TCC GCA TAC ACC TGG ATG GGC CTC AGT 869 
Phe Val Gin Lys Tyr Leu Gly Ser Ala Tyr Thr Trp Met Gly Leu Ser 
185 190 195 

GAC CCT GAA GGA GCC TGG AAG TGG GTG GAT GGA AC A GAC TAT GCG ACC 917 
Asp Pro Glu Gly Ala Trp Lys Trp Val Asp Gly Thr Asp Tyr Ala Thr 
200 205 210 215 

GGC TTC CAG AAC TGG AAG CCA GGC CAG CCA GAC GAC TGG CAG GGG CAC 965 
Gly Phe Gin Asn Trp Lys Pro Gly Gin Pro Asp Asp Trp Gin Gly His 
220 225 230 

GGG CTG GGT GGA GGC GAG GAC TGT GCT CAC TTC CAT CCA GAC GGC AGG 1013 
Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe His Pro Asp Gly Arg 
235 240 245 

TGG AAT GAC GAC GTC TGC CAG AGG CCC TAC CAC TGG GTC TGC GAG GCT 1061 
Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr His Trp Val Cys Glu Ala 
250 255 260 

GGC CTG GGT CAG ACC AGC CAG GAG AGT CAC TGAGCTGCCT TTGGTGGGAC 1111 
Gly Leu Gly Gin Thr Ser Gin Glu Ser His 
265 270 
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CACCCGGCGA GAGAAATGGe GGTGGGAGGA GGACTCTTCT CACGACCTCC tCGGAAGACe. 11.71 

iSCTCTGGGAd AGAAATAAGC ACTGGGAGAT TQGAAGCACT GCTAACATTT TGAATTTTTT 1231 

TCfPGTTTAAT TTTAAAAAGA TGGTATAGTG- TtGTTAAGCT TTrATTTT'Tt TTiECAAGtTT 129"1 

TGAAAGTGAA CTTCAT6AAG GTATAATTTT TACATAATAA AAATGCACtd ATTTAAAGAG 1351 

TAAAAAAAAA AAAAAAAAA 1370 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRiPTION: SEQ ID NO: 10: 

Met Thr Arg Thr Tyr Glu Ash Phe Gin Tyr Leu Glu Asn Lys Val Lys 
15 10 15 

Val Gin Gly Phe Lys Asn Gly Pro Leu Pro Leu Gin Ser Leu Leu Leu 
20 25 30 

Leu Val lie lie Cys Val Val Gly Phe Gin Asn Ser Lys Phe Gin Arg 
35 40 45 

Asp Leu Val Thr Leu Arg Thr Asp Phe Ser Asn Phe Thr Ser Asn Thr 
50 55 60 

Val Ala Glu lie Gin Ala Leu Thr Ser Gin Gly Ser Ser Leu Glu Glu 
65 70 75 80 

Thr lie Ala Ser Leu Lys Ala Glu Val Glu Gly Phe Lys Gin Glu Arg 
85 90 95 

Gin Ala Val His Ser Glu Met Leu Leu Arg Val Gin Gin Leu Val Gin 
100 105 110 

Asp Leu Lys Lys Leu Thr Cys Gin Val Ala Thr Leu Asn Asn Asn Gly 

115 120 125 

Glu Glu Ala Ser Thr Glu Gly Thr Cys Cys Pro Val Asn Trp Val Glu 
130 135 140 

His Gin Asp Ser Cys Tyr Trp Phe Ser His Ser Gly Met Ser Trp Ala 
145 150 155 160 

Glu Ala Glu Lys Tyr Cys Gin Leu Lys Asn Ala His Leu Val Val He 
165 170 175 

Asn Ser Arg Glu Glu Gin Asn Phe Val Gin Lys Tyr Leu Gly Ser Ala 
180 185 190 
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Tyr> Thr Trp Met Gly Leu Sex Asp Pro Glu Gly Ala Trp Lys Trp Val 
195 200 205 

Asp Gly Thr Asp Tyr Ala Thr Gly Phe: Gin Asn Trp Lys Pro Gly Gin 
210 215 220 

Pro Asp Asp Trp Gin Gly His Gly Leu Gly Gly Gly Glu Asp Gys: Ala 
225 230 235 240 

His Phe His Pro Asp Gly Arg Trp Asn Asp Asp Val Cys Gin Arg Pro 
245 250 255 

Tyr His Trp Val Cys Glu Ala Gly Leu Gly Gin Thr S,er Gin Glu Ser 
260 265 270 

His 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) iSEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu Lys Met lie lie Lys Glu Leu Asn Tyr Thr Glu Leu Glu Cys Thr 
IS 10 15 

Lys Trp Ala Ser Leu Leu Glu Asp Lys Val Trp Ser Cys Cys Pro Lys 
20 25 30 

Asp Trp Lys Pro Phe Gly Ser Tyr Cys Tyr Phe Thr Ser Thr Asp Leu 
35 40 45 

Val Ala Ser Trp Asn Glu Ser Lys Glu Asn Cys Phe His Met Gly Ala 
50 55 60 

His Leu Val Val He His Ser Gin Glu Glu Gin 
65 70 75 
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